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Abstract 

The role of improved sehooling, a central part of most development strategies, has become 
controversial because expansion of school attainment has not guaranteed improved 
economic conditions. This paper reviews the role of education in promoting economic well- 
being, with a partieular foeus on the role of edueational quality. It eoneludes that there is 
strong evidenee that the eognitive skills of the population - rather than mere sehool 
attainment - are powerfully related to individual earnings, to the distribution of income, 
and to economic growth. New empirical results show the importance of both minimal and 
high level skills, the eomplementarity of skills and the quality of eeonomic institutions, and 
the robustness of the relationship between skills and growth. International eomparisons 
incorporating expanded data on cognitive skills reveal mueh larger skill defieits in 
developing countries than generally derived from just sehool enrollment and attainment. 
The magnitude of ehange needed makes clear that closing the eeonomic gap with 
developed eountries will require major structural changes in sehooling institutions. 
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1. Introduction 



It takes little analysis to see that education levels differ dramatically between developing and 
developed countries. Building upon several decades of thought about human capital - and centuries of 
general attention to education in the more advanced countries - it is natural to believe that a productive 
development strategy would be to raise the schooling levels of the population. And, indeed, this is 
exactly the approach of the Education for All initiative and a central element of the Millennium 
Development Goals. 

But there are also some nagging uncertainties that exist with this strategy. First, developed and 
developing countries differ in a myriad of ways other than schooling levels. Second, a number of 
countries - both on their own and with the assistance of others - have expanded schooling 
opportunities without seeing any dramatic catch-up with developed countries in terms of economic 
well-being. Third, countries that do not function well in general might not be more able to mount 
effective education programs than they are to pursue other societal goals. Fourth, even when schooling 
policy is made a focal point, many of the approaches undertaken do not seem very effective and do not 
lead to the anticipated student outcomes. In sum, is it obvious that education is the driving force, or 
merely one of several factors that are correlated with more fundamental development forces? 

The objective of this study is to review what is known about the role of education in promoting 
economic well-being. We are interested in assessing what research says about these issues. More than 
that, we pay particular attention to the credibility of this research in establishing a causal relationship 
between education and economic outcomes and between policy initiatives and educational outcomes. 

The discussion also has one distinctive element. We have come to conclude that educational 
quality - particularly in assessing policies related to developing countries - is THE key issue. It is both 
conventional and convenient in policy discussions to concentrate on such things as years of school 
attainment or enrollment rates in schools. These things are readily observed and measured. They appear 
in administrative data, and they are published on a consistent basis in virtually all countries of the 
world. And, they are very misleading in the policy debates. 

We will show in graphic terms the differences in educational quality that exist. Most people would, 
in casual conversation, acknowledge that a year of schooling in a school in a Brazilian Amazon village 
was not the same as a year of schooling in a school in Belgium. They would also agree that families, 
peers, and others contribute to education. Yet, research on the economic impact of schools - largely 
due to expedience - almost uniformly ignores these. The data suggest that the casual conversation may 
actually tend to understate the magnitude of differences. 
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We will also provide strong evidence that ignoring quality differences significantly distorts the 
picture about the relationship between education and economic outcomes. This distortion occurs at 
three levels. It misses important differences between education and skills on the one hand and 
individual earnings on the other. It misses an important underlying factor determining the interpersonal 
distribution of incomes across societies. And, it very significantly misses the important element of 
education in economic growth. 

The plan of this study is straightforward. We begin by documenting the importance of cognitive 
skills - the measure of educational quality we use - in determining individual earnings, and by 
implication important aspects of the income distribution. We then turn to the relationship of education 
and economic growth. Research into the economics of growth has itself been a growth area, but much 
of the research focuses just on school attainment with no consideration of quality differences or of 
other sources of learning. We show, in part with new evidence, that the evidence is highly biased by its 
concentration on just quantity of schooling. 

In both of these areas, attention has been given to causality; i.e., is it reasonable to believe that 
changing education would directly lead to a change in economic outcomes? Again, the concentration 
on quantity of schooling has distorted these discussions of causality, and consideration of quality 
considerably alters the issues and implications. 

The simple answers in the discussion of economic implications of education are that educational 
quality, measured by cognitive skills, has a strong impact on individual earnings. More than that, 
however, educational quality has a strong and robust influence on economic growth. In both areas, 
there is credible evidence that these are truly causal relationships. 

To be sure, none of this says that schools per se are the answer. Even though it is common to treat 
education and schooling synonymously, it is important to distinguish between knowledge and skills on 
the one hand (educational quality in our terminology) and schooling. This semantic distinction has 
important substantive underpinnings. Cognitive skills may be developed in formal schooling, but they 
may also come from the family, the peers, the culture, and so forth. Moreover, other factors obviously 
have an important impact on earnings and growth. For example, overall economic institutions - a well- 
defined system of property rights, the openness of the economy, the security of the nation - can be 
viewed almost as preconditions to economic development. And, without them, education and skills 
may not have the desired impact on economic outcomes. 

Yet, while recognizing the impact of these overall institutions, we find that schools can play an 
important role. Quality schools can lead to improved educational outcomes. Moreover, from a public 
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policy perspective, interventions in the schools are generally viewed as both more acceptable and more 
likely to succeed than, say, direct interventions in the family. 

Given the evidence on the importance of educational quality for economic outcomes, the study 
turns to important policy issues. To begin with, what can be said about the educational quality and 
cognitive skills in developing countries? Although information on enrollment and attainment has been 
fairly widely available, quality information has not. We use newly developed data on international 
comparisons of cognitive skills (also employed in the analysis of growth) to show that the education 
deficits in developing countries are larger than previously appreciated. 

Discussions of quality inevitably lead to questions about whether it can be affected by policy. An 
extensive literature, albeit one biased toward developed countries, now exists on a number of policy 
issues. Perhaps most well known is that simply putting more resources into schools - pure spending, 
reduced class sizes, increased teacher training, and the like - will not reliably lead to improvements in 
student outcomes. 

These findings are, however, often misinterpreted. First, they do not imply that schools have no 
effect. They say simply that common measures of school quality are in reality not closely related to 
student outcomes, but this is not the same as finding that school quality differences do not exist. 
Second, the findings do not say that spending and resources never matter. Indeed, there is some 
indication, particularly in developing countries, that a range of resources are important - textbooks, 
rudimentary facilities, and the like. The potential impacts of these are nonetheless too small to be 
instruments for radical changes in outcomes, something that the prior evidence indicates is needed in 
many developing countries. Third, the findings do not say that resources cannot matter. They indicate 
that resources may not have any consistent effects within the current structure and institutions of 
schools, but the findings do not put resource discussions into the context of alternative structures. 

One consistent finding that is emerging from research, albeit largely from developed country 
experiences, is that teacher quality has powerful impacts on student outcomes. The problem from a 
policy aspect is, however, that quality differences are not closely related to the common measures of 
quality and to the common policy instruments that are employed. Within countries where the data exist, 
there is little indication that quality is closely related to teacher education and training, teacher 
experience, teacher certification, or teacher salaries. These facts disrupt the policy discussions. They 
also make it clear that different sets of policies must be contemplated if schools are to improve. 

A different view of schools, however, concentrates on larger institutional issues. There is growing 
evidence that a number of devices - things that effectively change the existing incentives in schools - 
have an impact. Accountability systems based upon tests of student cognitive achievement can change 
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the incentives for both school personnel and for students. By focusing attention on the true policy goal 
- instead of imperfect proxies based on inputs to schools - performance can be improved. These 
systems align rewards with outcomes. Moreover, increased local decision making or local autonomy, 
coupled with accountability, can facilitate these improvements. 

The evidence on a set of larger, and potentially more powerful, policy changes is relatively limited 
at the current time. There is suggestive evidence that greater school choice promotes better 
performance. Further, direct incentives to teachers and school personnel in the form of performance 
pay have promise. Unfortunately, however, these policies can lead to substantial changes in the 
incentives within schools, and such substantial changes are frequently resisted by current school 
personnel. Current employees, often through their unions, generally tend to resist and to stop even 
experimentation with such changes. Thus, direct evidence on them is more limited, and may require 
more inferences. Nonetheless, there remains reason to believe that pursuing these larger changes could 
lead to the substantial improvements in outcomes that are desired or hoped for in the policy process. 
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2. Individual Returns to Education and Economic Inequality 

2.1 Impacts of School Attainment on Individual Incomes 

Most attention to the value of schooling focuses on the economic returns to differing levels of 
school attainment for individuals. This work, following the innovative analyses of human capital by 
Jacob Mincer (1970, 1974), considers how investing in differing amounts of schooling affects 
individual earnings. Over the past thirty years, literally hundreds of such studies have been conducted 
around the world. ^ 

These studies have uniformly shown that more schooling is associated with higher individual 
earnings. The rate of return to schooling across countries is centered at about 10 percent with variations 
in expected ways based largely on scarcity: returns appear higher for low income countries, for lower 
levels of schooling, and, frequently, for women (Psacharopoulos and Patrinos (2004)). 

Much of the academic debate has focused on whether these simple estimates provide credible 
measures of the causal effect of schooling. In particular, if more able people tend also to obtain 
additional schooling, the estimated schooling effect could include both the impacts of schooling and the 
fact that those continuing in school could earn more in the absence of schooling.^ For the most part, 
employing alternative estimation approaches dealing with the problems of endogeneity of schooling do 
not lead to large changes in the estimates, and many times they suggest that the returns are actually 
larger with the alternative estimation schemes than with the simpler modeling strategies. 

The basic estimates of Mincer earnings models are typically interpreted as the private returns to 
schooling. As is well known, the social returns could differ from the private returns - and could be 
either above or below the private returns. The most common argument is that the social returns will 
exceed the private returns because of the positive effects of education on crime, health, fertility. 



* A variety of studies review and interpret the basic estimation of rates of return. See Psacharopoulos (1994), Card 
(1999), Harmon, Oosterbeek, and Walker (2003), Psacharopoulos and Patrinos (2004), and Heckman, Lochner, and Todd 
(2006). 

^ Harmon, Oosterbeek, and Walker (2003) systematically review the various issues and analytical approaches dealing 
with them along with providing a set of consistent estimates of returns (largely for OECD countries). They conclude that, 
while the estimation approaches can have an impact on the precise value of the rate of return, it is clear that there is a strong 
causal impact of school attainment on earnings. 
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improved citizen participation,^ and (as we discuss below) on growth and productivity of the economy 
as a whole/ If on the other hand schooling was more of a selection device than of a means of boosting 
knowledge and skills of individuals, the social return could be below the private return/ Although there 
are many uncertainties about precisely how social returns might differ from private returns, there is 
overall little reason to believe that the social returns are less than the private returns, and there are a 
variety of reasons to believe that they could be noticeably higher. 

2.2 Impacts of Educational Quality on Individual Incomes — Developed Countries 

The concentration on school attainment in the academic literature, however, contrasts with much 
of the policy discussion that, even in the poorest areas, involves elements of “quality” of schooling. 
Most countries are involved in policy debates about the improvement of their schools. These debates, 
often phrased in terms of such things as teacher salaries or class sizes, rest on a presumption that there 
is a high rate of return to schools in general and to quality in particular. 

But it is not appropriate simply to presume that any spending on schools is a productive investment 
that will see the returns estimated for attainment. It is instead necessary to ascertain two things: how 
various investments translate into quality and how that quality relates to economic returns. This section 
provides a summary of what is known about the individual returns to educational quality in both 
developed and developing countries. 

One of the challenges to understanding the impact of quality differences in human capital has been 
simply knowing how to measure quality. Much of the discussion of quality — in part related to new 
efforts to provide better accountability — has identified cognitive skills as the important dimension. 



^ Recent studies indeed find evidence of externalities of education in such areas as reduced crime (Lochner and Moretti 
(2004)), improved health of children (Currie and Moretti (2003)), and improved civic participation (Dee (2004); Milligan, 
Moretti, and Oreopoulos (2004)). The evidence on direct production spillovers of education among workers is more mixed, 
with Moretti (2004) and the studies cited therein finding favorable evidence and Acemoglu and Angrist (2000) and Ciccone 
and Peri (2006) finding no evidence for this kind of spillovers. 

* In the Mincer earnings work, social rates of return are frequently calculated. These calculations are not based on the 
positive externalities cited but instead on the fact that the social cost of subsidized education exceeds the private costs - thus 
lowering the social rate of return relative to the private rate of return (see Psacharopoulos and Patrinos (2004)). 

^ The empirical analysis of these issues has been very difficult because the labor market outcomes of the 
screening/selection model and the productivity/human capital model are very similar if not identical. Lange and Topel 
(2006) review the theory and empirical work and conclude that there is little evidence that the social rate of return to 
schooling is below the private rate of return. 
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And, while there is ongoing debate about the testing and measurement of these skills, most parents and 
policy makers alike accept the notion that cognitive skills are a key dimension of schooling outcomes. 
The question is whether this proxy for school quality — students’ performance on standardized tests — is 
correlated with individuals’ performance in the labor market and the economy’s ability to grow. 

Until fairly recently, little comprehensive data have been available to show any relationship 
between differences in cognitive skills and any related economic outcomes. The many analyses of 
school attainment and Mincer earnings functions rely upon readily available data from censuses and 
other surveys, which find it easy to collect information on earnings, school attainment, age, and other 
demographic information. On the other hand, it is difficult to obtain data on cognitive skills along with 
earnings and the other determinants of wages. Although cognitive test and school resource data are 
increasingly available at the time of schooling, these are seldom linked to subsequent labor market 
information. Such analyses generally require tracking individuals over time, a much more difficult data 
collection scheme. Such data are, however, now becoming available. 

A variety of researchers are now able to document that the earnings advantages to higher 
achievement on standardized tests are quite substantial.'’ While these analyses emphasize different 
aspects of individual earnings, they typically find that measured achievement has a clear impact on 
earnings after allowing for differences in the quantity of schooling, the experiences of workers, and 
other factors that might also influence earnings. In other words, higher quality as measured by tests 
similar to those currently being used in accountability systems around the world is closely related to 
individual productivity and earnings. 

Three recent U.S. studies provide direct and quite consistent estimates of the impact of test 
performance on earnings (Mulligan (1999); Murnane, Willett, Duhaldeborde, and Tyler (2000); Lazear 
(2003)). These studies employ different nationally representative data sets that follow students after 
they leave school and enter the labor force. When scores are standardized, they suggest that one 



“ These results are derived from different specific approaches, but the basic underlying analysis involves estimating a 
standard “Mincer” earnings function and adding a measure of individual cognitive skills. This approach relates the 
logarithm of earnings to years of schooling, experience, and other factors that might yield individual earnings differences. 
The clearest analyses are found in the following references for the U.S. (which are analyzed in Hanushek (2002b)). See 
Bishop (1989, 1991); O'Neill (1990); Grogger and Eide (1993); Blackburn and Neumark (1993, 1995); Mumane, Willett, 
and Levy (1995); Neal and Johnson (1996); Mulligan (1999); Murnane, Willett, Duhaldeborde, and Tyler (2000); Altonji 
and Pierret (2001); Murnane, Willett, Braatz, and Duhaldeborde (2001); and Lazear (2003). 
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standard deviation increase in mathematics performance at the end of high schools translates into 12 
percent higher annual earnings/ 

Mumane, Willett, Duhaldeborde, and Tyler (2000) provide evidence from the High School and 
Beyond and the National Longitudinal Survey of the High School Class of 1972. Their estimates 
suggest some variation with males obtaining a 15 percent increase and females a 10 percent increase 
per standard deviation of test performance. Lazear (2003), relying on a somewhat younger sample from 
NELS88, provides a single estimate of 12 percent. These estimates are also very close to those in 

o 

Mulligan (1999), who finds 11 percent for the normali z ed AFQT score in the NLSY data. Note that 
these returns can be thought of as how much earnings would increase with higher quality each and 
every year throughout the persons’ working career. Thus, the present value of the returns to higher 
quality is large. 

These estimates are obtained fairly early in the work career (mid- 20s to early 30s), and analyses of 
the impact of cognitive skills across the entire work life are more limited. Altonji and Pierret (2001) 
find that the impact of achievement on earnings grows with experience, because the employer has a 
chance to observe the performance of workers. The pattern of how returns change with age from their 
analysis is shown in Figure 2.1, where the power of school attainment differences to predict differences 
in earnings is replaced by cognitive skills as workers are in the labor force longer. The evidence is 
consistent with employers relying on readily available information on school attainment when they do 
not have other information and switching to observations of skills and performance as that information 
becomes available through job performance.^ On the other hand, Hanushek and Zhang (2006) do not 
find that this pattern holds for a wider set of countries (although it continues to hold for the United 



Because the units of measurement differ across tests, it is convenient to convert test scores into measures of the 
distribution of achievement across the population. A one-half standard deviation change would move somebody from the 
middle of the distribution (the 50* percentile) to the 69* percentile; a one standard deviation change would move this 
person to the 84* percentile. Because tests tend to follow a bell-shaped distribution, the percentile movements are largest at 
the center of the distribution. 

® By way of comparison, we noted that estimates of the value of an additional year of school attainment are typically 
about 10 percent. Of course, any investment decisions must recognize that quality and quantity are generally produced 
together and that costs of changing each must be taken into account. 

® Note that Altonji and Pierret (2001) observe a limited age range, so that these changing returns may well be thought 
of as leveling off after some amount of labor market experience. 



8 




States in their data). Thus, there is some uneertainty eurrently about whether eognitive skills have 
differential effeets on eeonomie outeomes over the work-experienee profile. 



Figure 2.1: Returns to Observed Educational Quantity and Unobserved Educational Quality 
over the Work Life 




Work experience fvears) 

Notes; Based on data from National Longitudinal Survey of Youth (NLSY) and Armed Forces Qualification Test (AFQT). 
SD = standard deviation. 

Source: Based on Altonji and Pierret (2001). 

There are other reasons to believe that these estimates provide a lower bound on the impact of 
higher achievement. First, the labor market experiences that are observed begin in the mid-1980s and 
extend into the mid-1990s, but other evidence suggests that the value of skills and of schooling has 
grown throughout and past that period. Second, extrapolating from recent patterns, future general 
improvements in productivity are likely to lead to larger returns to skill." Finally, cognitive skills 

*”Hanushek and Zhang (2006), discussed below, do find that the importance of cognitive skills is not restricted just to 
younger workers (represented in most prior research) but holds across the experience spectrum. 

" The considered analyses typically compare workers of different ages at one point in time to obtain an estimate of 
how earnings will change for any individual. If, however, productivity improvements occur in the economy, these will tend 
to raise the earnings of individuals over time. If recent patterns of skill bias in productivity improvement continue, the 
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measured by test scores are prone to considerable measurement error. Even if the tests were measuring 
exactly the relevant skill concept, we know that there are substantial errors in each test.^^ These errors 
will in general lead to downward biases in the estimated coefficients. 

A limited number of additional studies are available for developed countries outside of the United 
States. McIntosh and Vignoles (2001) study wages in the United Kingdom and find strong returns to 
both numeracy and literacy. Finnic and Meng (2002) and Green and Riddell (2003) investigate returns 
to cognitive skills in Canada. Both suggest that literacy has a significant return, but Finnic and Meng 
(2002) find an insignificant return to numeracy. This latter finding stands at odds with most other 
analyses that have emphasized numeracy or math skills. 

Another part of the return to school quality comes through continuation in school.^"* There is 
substantial U.S. evidence that students who do better in school, either through grades or scores on 
standardized achievement tests, tend to go farther in school.'^ Mumane, Willett, Duhaldeborde, and 

impact of improvements in student skills are likely to rise over the work life instead of being constant as portrayed here (cf. 
Katz and Murphy (1992)). On the other hand, such skill-biased change has not always been the case, and technology could 
push returns in the opposite direction. 

In most testing situations, both the reliability of the specific test and the validity of the test are considered. 
Reliability relates to how well the test measures the specific material - and would include elements of specific question 
development and choice along with individual variations that would occur if an individual took the same test at different 
points in time. Validity refers to the correspondence between the desired concept (skills related to productivity or earnings 
differences) and the specific choice of test domains (such as mathematical concepts at some specific level). 

Because they look at discrete levels of skills, it is difficult to compare the quantitative magnitudes directly to the 
U.S. work. 

14 

As noted above, much of the work by economists on differences in worker skills has actually been directed at the 
issue of determining the average labor market returns to additional schooling. The argument has been that higher- ability 
students are more likely to continue in schooling. Therefore, part of the higher earnings observed for those with additional 
schooling really reflects pay for added ability and not for the additional schooling. Economists have pursued a variety of 
analytical approaches for dealing with this, including adjusting for measured cognitive test scores, but this work generally 
ignores issues of variation in school quality. The various adjustments for ability differences typically make small differences 
on the estimates of the value of schooling, and Heckman and Vytlacil (2001) argue that it is not possible to separate the 
effects of ability and schooling. The only explicit consideration of school quality typically investigates expenditure and 
resource differences across schools, but these are known to be poor measures of school quality differences (Hanushek 
(2002b)). 

See, for example, Dugan (1976); Manski and Wise (1983). Rivkin (1995) finds that variations in test scores capture 
a considerable proportion of the systematic variation in high school completion and in college continuation, so that test 
score differences can fully explain black-white differences in schooling. Bishop (1991) and Hanushek, Rivkin, and Taylor 
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Tyler (2000) separate the direct returns to measured skill from the indirect returns of more schooling 
and suggest that perhaps one-third to one-half of the full return to higher achievement comes from 
further schooling. Note also that the effect of quality improvements on school attainment incorporates 
concerns about drop out rates. Specifically, higher student achievement keeps students in school longer, 
which will lead among other things to higher graduation rates at all levels of schooling.^’’ 

Knighton and Bussiere (2006) find that higher scores at age 15 lead to significantly higher rates of 
post-secondary schooling of Canadian 19-year-olds. This finding is particularly interesting for the 
international comparisons that we consider below, because the analysis follows up on precisely the 

17 

international testing that is used in our analysis of economic growth. 

2.3 Impacts of Educational Quality on Individual Incomes — Developing Countries 

Questions remain about whether the clear impacts of quality in the U.S. generalize to other 
countries, particularly developing countries. The literature on returns to cognitive skills in developing 
countries is restricted to a relatively limited number of countries: Ghana, Kenya, Morocco, Pakistan, 
South Africa, and Tanzania. Moreover, a number of studies actually employ the same basic data, albeit 
with different analytical approaches, but come up with somewhat different results. Table 2.1 provides a 
simple summary of the quantitative estimates available for developing countries. 



(1996), in considering the factors that influence school attainment, find that individual achievement scores are highly 
correlated with continued school attendance. Neal and Johnson (1996) in part use the impact of achievement differences of 
blacks and whites on school attainment to explain racial differences in incomes. Their point estimates of the impact of 
cognitive skills (AFQT) on earnings and school attendance appear to be roughly comparable to that found in Mumane, 
Willett, Duhaldeborde, and Tyler (2000). Behrman, Kletzer, McPherson, and Schapiro (1998) find strong achievement 
effects on both continuation into college and quality of college; moreover, the effects are larger when proper account is 
taken of the various determinants of achievement. Hanushek and Pace (1995) find that college completion is significantly 
related to higher test scores at the end of high school. 

This work has not, however, investigated how achievement affects the ultimate outcomes of additional 
schooling. For example, if over time lower-achieving students tend increasingly to attend further schooling, these schools 
may be forced to offer more remedial courses, and the variation of what students know and can do at the end of school may 
expand commensurately. 

The OECD tested random samples of 15-year-old students across participating countries under the PISA program in 
2000. Students taking these tests in Canada were then followed and surveyed in 2002 and 2004. See Section 4, below. 
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Table 2.1: Summary of Estimated Returns to a Standard Deviation Increase in Cognitive Skills 



Country 


Study 


Estimated effecC 


Notes 


Ghana 


Glewwe (1996) 


0.21 **-0.3** (government) 
0.14-0.17 (private) 


Alternative estimation approaches yield some differences; math effects shown generally 
more important than reading effects, and all hold even with Raven’s test for ability. 


Ghana 


Jolliffe (1998) 


0.05-0.07* 


Household income related to average math score with relatively small variation by 
estimation approach; effect is only observed with off-farm income, and on-farm income 
is not significantly related to cognitive skills. 


Ghana 


Vijverberg (1999) 


? 


Income estimates for math and reading with nonfarm self-employment; highly variable 
estimates (including both positive and negative effects) but effects not generally 
statistically significant. 


Kenya 


Boissiere, Knight, 
and Sabot (1985); 
Knight and Sabot 
(1990) 


0.19**-0.22** 


Total sample estimates: small variation by primary and secondary school leavers. 


Morocco 


Angrist and Lavy 
(1997) 


? 


Cannot convert to standardized scores because use indexes of performance; Erench 
writing skills appear most important for earnings, but results depend on estimation 
approach. 


Pakistan 


Alderman, 
Behrman, Ross, 
and Sabot (1996) 


0.12-0.28* 


Variation by alternative approaches and by controls for ability and health; larger and 
more significant without ability and health controls. 


Pakistan 


Behrman, Ross, 
and Sabot 
(forthcoming) 


0.25 


Estimates of structural model with combined scores for cognitive skill; significant 
effects of combined math and reading scores which are instrumented by school inputs 


South Africa 


Moll (1998) 


0.34**-0.48** 


Depending on estimation method, varying impact of computation; comprehension (not 
shown) generally insignificant. 


Tanzania 


Boissiere, Knight, 
and Sabot (1985); 
Knight and Sabot 
(1990) 


0.07-0.13* 


Total sample estimates: smaller for primary than secondary school leavers. 



*significant at 0.05 level; **significant at 0.01 level. 

a. Estimates indicate proportional increase in wages from a one standard deviation increase in measured test scores. 
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The summary of the evidence in Table 2.1 permits a tentative conclusion that the returns to quality 
may be even larger in developing countries than in developed countries. This of course would be 
consistent with the range of estimates for returns to quantity of schooling (e.g,., Psacharopoulos (1994) 
and Psacharopoulos and Patrinos (2004)), which are frequently interpreted as indicating diminishing 
marginal returns to schooling. 

There are some reasons for caution in interpreting the precise magnitude of estimates. First, the 
estimates appear to be quite sensitive to the estimation methodology itself. Both within individual 
studies and across studies using the same basic data, the results are quite sensitive to the techniques 
employed in uncovering the fundamental parameter for cognitive skills.'* Second, the evidence on 
variations within developing countries is not entirely clear. For example, Jolliffe (1998) finds little 
impact of skills on farm income, while Behrman, Ross, and Sabot (forthcoming) suggest an 
equivalence across sectors at least on theoretical grounds. 

Nonetheless, the overall summary is that the available estimates of the impact of cognitive skills on 
outcomes suggest strong economic returns within developing countries. The substantial magnitude of 
the typical estimates indicates that educational quality concerns are very real for developing countries 
and that this aspect of schools simply cannot be ignored - a topic that comes up below. 

Evidence also suggests that educational quality is directly related to school attainment. In Brazil, a 
country plagued by high rates of grade repetition and ultimate school dropouts, Harbison and Hanushek 
(1992) show that higher cognitive skills in primary school lead to lower repetition rates. Further, 
Hanushek, Lavy, and Hitomi (2006) find that lower quality schools, measured by lower value-added to 
cognitive achievement, lead to higher dropout rates in Egyptian primary schools. Thus, as found for 
developed countries, the full economic impact of higher educational quality comes in part through 
greater school attainment. 

This complementarity of school quality and attainment also means that actions that actually 
improve quality of schools will yield a bonus in terms of meeting goals for attainment. Conversely, 
simply attempting to expand access and attainment, say through starting a large number of low quality 
schools, will be self-defeating to the extent that there is a direct reaction to the low quality that affects 
the actual attainment results. 



** The sensitivity to estimation approach is not always the case; see, for example, Jolliffe (1998). A critique and 
interpretation of the alternative approaches within a number of these studies can be found in Glewwe (2002). 
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2.4 Evidence from the International Adult Literacy Survey 

The preceding analyses for both developed and developing countries rely largely on panel data that 
follow individuals from school into the labor market. The alternative approach as found in the 
International Adult Literacy Survey (lALS) is to test a sample of adults and then to relate these 
measures to labor market experiences.'® An advantage of this data collection approach is that it 
provides information about the labor market experiences across a broader range of age and labor 
market experience.^" 

Consistent data on basic skills of literacy and numeracy for a representative sample of the 
population aged 15-65 were collected for a sample of countries between 1994 and 1998.^' These data 
permit direct comparisons of the relative importance of quantity and quality of schooling across 
countries, although the bias toward developed economies remains. Hanushek and Zhang (2006) 

estimate returns to school attainment and to literacy scores for the 13 countries where continuous 

22 

measures of individual earnings are available. 

Figures 2.2 and 2.3 provide the relevant summary information on the returns to skills, estimated in 
a model that jointly includes school attainment and literacy scores. As in the prior analyses, both school 
attainment and cognitive skills enter into the determination of individual incomes. With the exception 
of Poland, literacy scores have a consistent positive impact on earnings (Figure 2.2). The (unweighted) 
average of the impact of literacy scores is 0.093, only slightly less than found previously for the U.S. 



This design was subsequently repeated in 2003 with the Adult Literacy and Lifeskills Survey (ALL), but only six 
countries participated, and the data were unavailable for this study. 

This approach does also present some complications, because the individuals of different ages have both different 
adult learning experiences and different times of attending school of possibly different quality. Hanushek and Zhang (2006) 
consider these alternatives, but they do not change the qualitative results about the impact of cognitive skills that are 
presented here. 

21 

Twenty-three countries and regions participated in one of three different waves of surveys conducted in 1994, 1996 
and 1998. Country participation included some sub-country divisions (e.g., Flemish speaking Belgium and Italian speaking 
Switzerland). Not all countries provided background information and wage information. lALS provides information on the 
skills and attitudes of adults aged 16-65 in a number of different areas, and the analysis here combines the scores in a single 
measure of literacy. 

Samples include full-time workers between 26 and 65 years of age. In all the models, the dependent variable is the 
logarithm of annual earnings from employment; control variables are gender, potential experience and its square, and an 
indicator for living in rural area. 
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studies. On the other hand, the estimated impact of school attainment across the 13 countries is just 
0.049 after adjusting the Mincer returns for the literacy scores. This low estimate partly reflects the 
joint consideration of literacy scores and school attainment. The estimated return to years of schooling 
without considering literacy scores is 0.06, which is still below the more common estimates of about 
0.1. As Figure 2.3 shows, the estimates that adjust for literacy scores are below the unadjusted returns 
in every country except Poland. 



Figure 2.2: Returns to Cognitive Skills, International Adult Literacy Survey 
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Source: Hanushek and Zhang (2006). 



The U.S. is noticeably higher than other countries and the previous U.S. studies, perhaps reflecting that these 
earnings are obtained across the entire work life. The average excluding the U.S. is still 0.08. 
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Figure 2.3: Returns to School Attainment, International Adult Literacy Survey 
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Source: Hanushek and Zhang (2006). 

The literacy tests in lALS are designed to measure quite basic skills only, and yet the differences 
are strongly associated with higher earnings. These results, from a broad age spectrum across a number 
of countries, reinforce the importance of quality. 

The sample of countries for the lALS unfortunately contains just one developing country - Chile. 
Nonetheless, it is suggestive that the returns both to quantity of schooling and quality exceed those 
found across the countries with the exception of the United States. In the three transition economies 
(Czech Republic, Hungary, and Poland), the returns to school attainment are also near the top of the 
sample, but the returns to educational quality are noticeably lower - perhaps reflecting institutional 
aspects of their labor markets. 

2.5 Causality 

The focus on causality of estimated returns to school attainment has not been duplicated for the 
estimated impacts of cognitive skills. The potential problems are, however, considerably different. The 
tests, with the exception of the lALS sampling, are uniformly given at a date before the labor market 
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experiences - eliminating the possibility that higher income leads individuals to do things that raise 
their test scores.^"* Nevertheless, this fact alone does not solve all of the interpretative issues. 

For our analysis, the most important issue is the source of any test score differences. It is natural to 
believe that schools have an influence on tests, but clearly other factors also enter. The extensive 
investigations of the determinants of achievement differences indicate that parents, peers, 
neighborhoods, and other factors enter along with school factors in determining achievement (see 
Hanushek (2002b)). Thus, most importantly, it is inappropriate to interpret test scores as simply 
reflecting school quality or school policy. 

We return to a discussion of possible school policies below, but at this point it is sufficient to note 
that skills - however formed - have a systematic impact on earnings. Thus, if we can find approaches 
that increase skills reliably, the available evidence strongly indicates that individual earnings and 
productivity will also increase. 

The other side of this issue is also important, however. Using just quantity of schooling in the 
earnings analyses assumes that formal schooling is the only source of skill development. But, if a 
variety of other inputs such as families or peers is also important in the formation of human capital, 
simple years of schooling is subject to this additional source of systematic measurement error. 

2.6 Income Distribution 

One implication of the impact of cognitive skills on individual earnings is that the distribution of 
those skills in the economy will have a direct effect on the distribution of income. Cognitive skills by 
themselves do not of course determine the full distribution, because other factors such as labor market 
institutions, taxes, and the like enter. But the importance of skills is becoming increasingly evident. 

Very suggestive evidence on the impact of skills on the income distribution comes from Nickell 
(2004). Nickell, using the lALS data, considers how differences in the distribution of incomes across 
countries are affected by the distribution of skills and by institutional factors including unionization and 



The lALS sampling raises the concern that tests change with age, because of continual learning or simple age 
depreciation of skills and knowledge. An investigation of this by Hanushek and Zhang (2006) suggests that these are not 
large concerns in the estimation of the earnings functions. 

25 

There could be other things that simultaneously affect both scores and earnings, such that scores are simply a 
proxy for some other important factor. Available evidence gives no reason to suspect that such factors are important. 
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minimum wages. While union eoverage is statistieally significant, he concludes that “the bulk of the 
variation in earnings dispersion is generated by skill dispersion” (page Cl 1).^^ 

The impact of the skill distribution across countries is shown dramatically in Figure 2.4, which is 
derived from a comparison of the dispersion of wages and the dispersion of prose literacy scores (each 
measured as the ratio of the 90* to the 10* percentile). The tight pattern around the regression line 
reflects a simple correlation of 0.85 (which is not affected by including the other institutional factors). 



Figure 2.4: Inequality of Educational Quality and Earnings 
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in the International Adult Literacy Survey. 

Source: Nickell (2004). 



De Gregorio and Lee (2002) find a (somewhat weaker) positive association between inequality in years of schooling 
and income inequality. 
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Other studies have also concluded that skills have an increasing impact on the distribution of 
income (e.g., Juhn, Murphy, and Pierce (1993)). In the U.S., the distribution of incomes within 
schooling groups has been rising (Levy and Murnane (1992)), i.e., holding constant schooling 
attainment, the income distribution has become more dispersed in reflection of growing rewards to 
individual skills. 

Again, these studies do not attempt to describe the causal structure, and it would be inappropriate 
to attribute the variance in earnings simply to differences in the quantity or quality of schooling. 
Nonetheless, to the extent that these contribute to variations in cognitive skills, it is fair to conclude that 
policies aimed at improving school quality (and educational outcomes) will have direct impacts on the 
income distribution. 
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3. Quantity of Schooling and Economic Growth 

Given the microeconomic evidence of the productivity-enhancing effects of education, it seems 
natural to extend the view to the macroeconomic perspective of long-run economic growth of 
countries. From a theoretical viewpoint, there are at least three mechanisms through which education 
may affect economic growth. First, just as in the micro perspective, education increases the human 
capital inherent in the labor force, which increases labor productivity and thus transitional growth 
towards a higher equilibrium level of output (as in augmented neoclassical growth theories, cf. 
Mankiw, Romer, and Weil (1992)). Second, education may increase the innovative capacity of the 
economy, and the new knowledge on new technologies, products and processes promotes growth (as in 
theories of endogenous growth, cf., e.g., Lucas (1988); Romer (1990a); Aghion and Howitt (1998)). 
Third, education may facilitate the diffusion and transmission of knowledge needed to understand and 
process new information and to successfully implement new technologies devised by others, which 
again promotes economic growth (cf., e.g.. Nelson and Phelps (1966); Benhabib and Spiegel (2005)). 

3.1 Results of Initial Cross-country Growth Regressions 

Just as in the literature on microeconomic returns to education, the majority of the macroeconomic 
literature on economic growth that tries to test these predictions employs the quantitative measure of 
years of schooling, now averaged across the labor force.^^ Early studies used adult literacy rates (e.g., 
Azariadis and Drazen (1990); Romer (1990b)) or school enrollment ratios (e.g., Barro (1991); Mankiw, 
Romer, and Weil (1992); Levine and Renelt (1992)) as proxies for the human capital of an economy. 
These were followed by attempts to measure average years of schooling based on perpetual inventory 
methods (cf. Lau, Jamison, and Louat (1991); Nehru, Swanson, and Dubey (1995)).^^ Then, it was 
particularly the work by Barro and Lee (1993, 2001) which provided internationally comparable data 
on average years of schooling for a large sample of countries and years, based on a combination of 
census or survey data on educational attainment wherever possible, using literacy and enrollment data 
to fill gaps in the census data. 



More precisely, the most commonly used measure is average years of schooling in the working-age population, 
usually defined as the population aged 15 years and over, instead of the actual labor force. 

For a survey of measurement and specification issues from early growth accounting to current cross-country growth 
regressions, see WoBmann (2003b). 
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Using average years of schooling as the education measure implicitly assumes that a year of 
schooling delivers the same increase in knowledge and skills regardless of the education system. For 
example, a year of schooling in Papua New Guinea is assumed to create the same increase in 
productive human capital as a year of schooling in Japan. Additionally, this measure assumes that 
formal schooling is the primary (sole) source of education and, again, that variations in the quality of 
nonschool factors have a negligible effect on education outcomes. This neglect of cross-country 
differences in the quality of education is probably the major drawback of such a quantitative measure 
of schooling, and we will come back to this issue in great detail below. 

The standard method to estimate the effect of education on economic growth is to estimate cross- 
country growth regressions where countries’ average annual growth in gross domestic product (GDP) 
per capita over several decades is expressed as a function of measures of schooling and a set of other 
variables deemed to be important for economic growth. Following the classical contributions by Barro 
(1991, 1997) and Mankiw, Romer, and Weil (1992),^^ a vast early literature of cross-country growth 
regressions has tended to find a significant positive association between quantitative measures of 
schooling and economic growth (for extensive reviews of the literature, see, e.g., Topel (1999); Temple 
(2001); Krueger and Lindahl (2001); Sianesi and Van Reenen (2003)).^*^ To give an idea of the 
robustness of this association, in the recent extensive robustness analysis by Sala-i-Martin, 
Doppelhofer, and Miller (2004) of 67 explanatory variables in growth regressions on a sample of 88 
countries, primary schooling turns out to be the most robust influence factor (after an East Asian 
dummy) on growth in GDP per capita in 1960-1996. 

Because this literature has been extensively reviewed elsewhere (see references above), rather than 
going into the details of the numerous important contributions, we will focus on a few important issues 
that emerge in the literature. 



Temple and WoBmann (2006) show that the significantly positive effect of education that Mankiw, Romer, and Weil 
(1992) find does not depend on their often criticized use of an education flow measure based on enrollment rates, but can be 
replicated when using years of schooling as a measure of the level of human capital in their model. 

A closely related literature weights years of schooling by parameters from microeconomic Mincer earnings 
equations (see Section 2.1 above) to construct a measure of the stock of human capital, which is then used in growth 
accounting and development accounting exercises. These use a given macroeconomic production function together with 
parameter estimates from other research to calculate the share of cross-country differences in growth or levels of income 
which can be accounted for by cross-country differences in education (see Klenow and Rodriquez-Clare (1997) and Hall 
and Jones (1999) for examples). 
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3.2 More Recent Evidence on the Effects of Levels of and Growth in Years of Schooling 

To provide a basic representation of the association between years of schooling and economic 
growth on the most recent version of available data, we use a slightly extended version of the 
education data by Cohen and Soto (2001), representing the average years of schooling of the population 
aged 15 to 64, together with data on real GDP per capita in 1960-2000 from the latest update (version 

-lo 

6.1) of the Penn World Tables by Heston, Summers, and Aten (2002). Figure 3.1 plots the average 
annual rate of growth in GDP per capita over the 40-year period against years of schooling at the 
beginning of the period for a sample of 92 countries. Both growth and education are expressed 
conditional on the initial level of output, to account for the significant conditional convergence effect. 

The regression results depicted by Figure 3.1 imply that each year of schooling is statistically 
significantly associated with a long-run growth rate that is 0.58 percentage points higher. The positive 
association is substantially larger in the sample of non-OECD countries (at 0.56) than in the sample of 
OECD countries (at 0.26), which is in line with the pattern of larger returns to education in developing 
countries discussed above. However, after controlling for the influence of openness and the security 
of property rights, the association becomes substantially smaller and turns insignificant, and it is close 
to zero when the total fertility rate is controlled for. Thus, while there is a clear positive association 
between years of schooling and growth in the latest available data, it is also somewhat sensitive to 
model specifications. 



See Jamison, Jamison, and Hanushek (forthcoming) for details of the extension. 

As discussed below, one line of investigation has been the impact of mismeasurement of the quantity of education 
on growth. The Cohen and Soto (2001) data improve upon the original quantity data by Barro and Lee. 

Added-variable plots show the association between two variables after the influences of other control variables are 
taken out. Thus, both of the two variables are first regressed on the other controls (in this case, initial GDP). Only the 
residual of these regressions, which is the part of the variation in the two variables which cannot be accounted for by the 
controls, is used in the graph. In so doing, the graph makes sure that the depicted association between the two variables is 
not driven by the control variables. The procedure is numerically equivalent to including the other controls in a multivariate 
regression of the dependent variable (growth) on the independent variable under consideration in the graph. 

The association is somewhat lower (at 0.32), but still significant when regional dummies are added to the regression. 

Similarly, the association is larger in the sample of countries below the median of initial output than above the 

median. 
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Figure 3.1: Added-variable Plot of Growth and Years of Schooling without Test-score Controls 




Conditional years of education 
coef = .58144999, se = .09536607, t = 6.1 



Notes; Added-variable plot of a regression of the average annual rate of growth (in percent) of real GDP per capita in 1960- 
2000 on average years of schooling in 1960 and the initial level of real GDP per capita in 1960. Own calculations. 



A considerable eontroversy has emerged in the literature about whether it is the level of years of 
sehooling (as would be predieted by several models of endogenous growth) or the change in years of 
sehooling (as would be predieted in basic neoclassical frameworks) which is the more important driver 
of eeonomie growth. The early evidence, such as in Benhabib and Spiegel (1994) and discussed in 
Barro and Sala-i-Martin (2004), found a positive effect of educational levels, but not of changes in 
education. However, Temple (1999) shows that in the latter ease, an existing positive association was 
hidden by a few unrepresentative outlying observations. Considerable evidenee has also emerged that 
there was substantial measurement error in the edueation data (ef. Krueger and Lindahl (2001)), and it 
is well known that measurement error affeets results based on ehanges in variables even more than 
results based on their levels. Subsequent evidenee suggests that both levels of and ehanges in years of 
sehooling may show a positive association with growth (cf. Gemmell (1996); Topel (1999); Krueger 
and Lindahl (2001)). 

More recently, two studies have tried to overcome some problems of mismeasurement in the early 
Barro-Lee data on years of sehooling: de la Fuente and Domeneeh (2001, 2006) for the sample of 
OECD eountries and Cohen and Soto (2001) for a broader sample of eountries (and the data that we use 
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here). Both studies find robust evidence of a positive association between changes in education and 
economic growth. Even more recently, Ciccone and Papaioannou (2005) find strong support for both 
the human capital level and the human capital accumulation effects using cross-country industry-level 
data that allow them to control for country-specific and industry-specific effects. 

When we add the change in years of schooling over 1960-2000 to the specification depicted in 
Figure 3.1 and similar specifications, it never turns significant with the sole exception of the sample of 
23 OECD countries. (Because of the possibly substantial amount of mismeasurement in the education 
data, though, the results on the first-differenced data on changes in education may well suffer). 

Thus, while recent research tends to find a positive effect of schooling quantity on economic 
growth, it seems beyond the scope of current data to draw strong conclusions about the relative 
importance of different mechanisms by which schooling quantity may affect economic growth. 
However, several recent studies suggest that education is important both as an investment in human 
capital and in facilitating research and development and the diffusion of technologies. With respect to 
the relative importance of the latter two mechanisms, Vandenbussche, Aghion, and Meghir (2006) 
suggest that innovation is more important relative to imitation for countries close to the technological 
frontier. As a consequence, the composition of human capital between basic and higher education may 
be important, with initial phases of education being more important for imitation and higher education 
being more important for innovation. Vandenbussche, Aghion, and Meghir (2006) provide evidence 
from a panel of OECD countries in line with this argument, which - when applied to developing 
countries - might suggest that a focus on basic skills seems warranted for developing countries. We 
will come back to the issue of the relative importance of basic versus advanced skills in more detail in 
Section 4.4 below. 

Two more skeptical studies raise noteworthy caveats, and we will return to each of them below. 
First, Bils and Klenow (2000) raise the issue of causality, suggesting that reverse causation running 
from higher economic growth to additional education may be at least as important as the causal effect 
of education on growth in the cross-country association. We will address the issue of causality in cross- 
country regressions in Section 4.2 below. Second, one of the conclusions that Pritchett (2001, 2006) 
draws from the fragility of the evidence linking changes in education to economic growth is that it is 
important for economic growth to get other things right as well, in particular the institutional 
framework of the economy. We will address this issue in Section 4.4 below. 

The positive association between growth in education and economic growth in the OECD sample is sensitive to the 
inclusion of Korea, though. 
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4. Quality of Education and Economic Growth 

4.1 A Review of the Basic Results 

The most important caveat with the literature on education and growth reviewed in the preceding 
section, though, is that it sticks to years of schooling as its measure of education at the neglect of 
qualitative differences in ensuing knowledge. As discussed in the section on individual returns to 
education, this neglect probably misses the core of what education is all about. And this neglect seems 
even more severe in cross-country comparisons than in analyses within countries: Who would sensibly 
assume that the average student in a school in Ghana or Peru would gain the same amount of 
knowledge in any year of schooling as the average student in a school in Finland or Korea? Still, using 
the quantitative measure of years of schooling does exactly that. It seems beyond doubt that one year of 
schooling does not create the same amount of acquired knowledge regardless of the quality of the 
education system in which it takes place, but delivers different increases in skills depending on the 
efficiency of the education system, the quality of teaching, the educational infrastructure, or the 
curriculum. Thus, rather than counting how long students have sat in school, it seems crucial to focus 
on how much students have learned while in school when estimating the effect of education on 
economic growth. 

Years of schooling has a second major shortcoming. Its use implicitly assumes that all skills and 
human capital formation come from formal schooling. Yet extensive evidence on knowledge 
development and cognitive skills indicates that a variety of factors outside of school - family, peers, 
and others - have a direct and powerful influence. Ignoring these nonschool factors introduces another 
element of measurement error into the growth analyses in the same way as it did in the analysis of 
individual earnings. 

From the mid-1960s to today, international agencies have conducted many international tests of 
students’ performance in cognitive skills such as mathematics and science (see the appendix to this 
section for details). Employing a re-scaling method that makes performance at different international 
tests comparable (again, see appendix), we can use performance on these standardized tests as a proxy 
for the quality of education. Figure 4.1 presents average student performance on twelve testing 
occasions on the transformed scale which maps performance on each test to the scale of the recent 



Note that each testing occasion may combine results from several tests at different age levels and in different 
subjects, so that there are 36 separate test observations altogether; see appendix for details. 
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PISA international test. This scale has a mean of 500 and a standard deviation of 100 among the OECD 

-lo 

countries in PISA. As is obvious from the figure, the developing countries that ever participated in 
one of the tests perform dramatically lower than any country in the group of OECD countries. The 
variation in the quality of education that exists among OECD countries is already substantial, but the 
magnitude of the difference to developing countries in the average amount of learning that has taken 
place after a given number of years of schooling dwarfs any within-OECD difference. 

Over the past ten years, empirical growth research demonstrates that consideration of the quality of 
education, measured by the cognitive skills learned, alters the assessment of the role of education in the 
process of economic development dramatically. When using the data from the international student 
achievement tests through 1991 to build a measure of educational quality, Hanushek and Kimko (2000) 
- first released as Hanushek and Kim (1995) - find a statistically and economically significant positive 
effect of the quality of education on economic growth in 1960-1990 that dwarfs the association 
between quantity of education and growth. Thus, even more than in the case of education and 
individual earnings, ignoring quality differences very significantly misses the true importance of 
education for economic growth. Their estimates suggest that one country-level standard deviation 
(equivalent to 47 test-score points in PISA 2000 mathematics, on whose scale Eigure 4.1 is based) 
higher test performance would yield around one percentage point higher annual growth rates. 



To facilitate comparisons, the within-country standard deviation of PISA test scores ranges between 80 and 108 in 
mathematics in the OECD countries; the U.S. value is 98. 

The precise scaling on the transformed metric is of course subject to considerable noise, in particular for the early 
tests and for countries performing far below the international mean. The tests are usually not developed to provide reliable 
estimates of performance in the tails of the achievement distribution, which would be relevant for very poorly performing 
countries. However, the rough pattern of overall performance should not be severely affected by the re-scaling. For 
example, the average performance level of Peruvian students on PISA 2002 where no re-scaling is involved is 292 in math, 
333 in science and 327 in reading. 
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Figure 4.1: Adjusted Performance on International Student Achievement Tests 
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This estimate stems from a statistical model that relates annual growth rates of real GDP per capita 
to the measure of educational quality, years of schooling, the initial level of income, and several other 
control variables (including in different specifications the population growth rates, political measures, 
openness of the economies, and the like). Hanushek and Kimko (2000) find that adding educational 
quality to a base specification including only initial income and educational quantity boosts the 
variance in GDP per capita among the 31 countries in their sample that can be explained by the model 
from 33 to 73 percent. The effect of years of schooling is greatly reduced by including quality, leaving 
it mostly insignificant. At the same time, adding the other factors leaves the effects of quality basically 
unchanged. 

Several studies have since found very similar results. Another early contribution, by Lee and Lee 
(1995), found an effect size similar to Hanushek and Kimko (2000) using data from the 1970-71 First 
International Science Study on the participating 17 countries, also leaving quantitative measures of 
education with no significant effect on growth. Using a more encompassing set of international tests, 
Barro (2001) also finds that, while both the quantity and the quality of education matter for economic 
growth, quality is much more important. Employing the quality measure developed by Hanushek and 
Kimko (2000) in a development accounting framework, WoBmann (2002, 2003b) finds that the share 
of cross-country variation in levels of economic development attributable to international differences in 
human capital rises dramatically when quality of education is taken into account."^° 

Extensions of the measure of Hanushek and Kimko (2000) and its imputation in WoBmann 
(2003b) are also used in the cross-country growth regressions by Bosworth and Collins (2003) and in 
the cross-country industry-level analysis by Ciccone and Papaioannou (2005). Both also find that 
educational quality strongly dominates any effect of educational quantity on growth.'^' Coulombe, 
Tremblay, and Marchand (2004) and Coulombe and Tremblay (2006) use test-score data from the 
International Adult Eiteracy Survey (see Section 2.4 above) in a panel of 14 OECD countries, 
confirming the result that the quality measure outperforms quantitative measures of education. Einally, 
Jamison, Jamison, and Hanushek (forthcoming) extend the Hanushek and Kimko (2000) analysis by 



^ This work builds on Gundlach, Rudman, and WoBmann (2002) and analyzes output per worker in 132 countries in 
1990. The variation that can be attributed to international differences in human capital rises from 21 percent to 45 percent 
once the quality of education is taken into account, and to over 60 percent in samples with reasonable data quality. 

Bosworth and Collins (2003) cannot distinguish the effect of quality of education from the effect of quality of 
government institutions. The analysis in Section 4.5 below shows, however, that they can be separated when we use our 
new quality measure that also extends the country sample by several additional data points on international tests scores. 
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using the mathematics component of the transformed and extended tests shown in Figure 4.1, 
replicating and strengthening the previous results by using test data from a larger number of countries, 
controlling for a larger number of potentially confounding variables and extending the time period of 
the analysis. Using the panel structure of their growth data, they suggest that education seems to 
improve income levels mainly though speeding up technological progress, rather than shifting the level 
of the production function or increasing the impact of an additional year of schooling. 

In sum, the existing evidence suggests that the quality of education, measured by the knowledge 
that students gain as depicted in tests of cognitive skills, is substantially more important for economic 
growth than the mere quantity of education. 

4.2 Issues of Endogeneity 

As noted, growth modeling is naturally subject to a common concern: Do the identified factors 
represent truly causal influences or mere associations that will not affect growth if altered by policy? 

The concerns about causality in the relationship of cognitive skills and growth have been addressed 
in detail by Hanushek and Kimko (2000). They conclude that causation concerns are very different in 
the case of quality than quantity, being much less of an issue in interpreting the results. Because 
causality issues are important for assessing our results below, we describe the Hanushek and Kimko 
direct investigations. 

One common concern in analyses such as this is that schooling might not be the actual cause of 
growth but, in fact, may just reflect other attributes of the economy that are beneficial to growth. For 
example, the East Asian countries consistently score very highly on the international tests (see Figure 
4.1), and they also had extraordinarily high growth over the 1960-1990 period. It may be that other 
aspects of these East Asian economies have driven their growth and that the statistical analysis of labor 
force quality simply is picking out these countries. But in fact, even if the East Asian countries are 
excluded from the analysis, a strong — albeit slightly smaller — relationship is still observed with test 
performance. This test of sensitivity of the results seems to reflect a basic importance of school quality, 
a factor that contributes also to the observed growth of East Asian countries. 

Another concern is that other factors that affect growth, such as efficient market organizations, are 
also associated with efficient and productive schools — so that, again, the test measures are really a 
proxy for other attributes of the country. To investigate this, Hanushek and Kimko concentrate on 
immigrants to the United States who received their education in their home countries. They find that 
immigrants who were schooled in countries that have higher scores on the international math and 
science examinations earn more in the United States. On the other hand immigrants receiving part or all 
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of their schooling in the United States do not see any earnings advantage linked to the cognitive skills 
of their home country. This analysis makes allowance for any differences in school attainment, labor 
market experience, or being native English-language speakers. In other words, skill differences as 
measured by the international tests are clearly rewarded in the United States labor market, reinforcing 
the validity of the tests as a measure of individual skills and productivity. 

Finally, the observed relationships could simply reflect reverse causality, that is, that countries that 
are growing rapidly have the added resources necessary to improve their schools and that better student 
performance is the result of growth, not the cause of growth. This issue is potentially important when 
policy alternatives are considered below, but, as a simple test of this, Hanushek and Kimko investigated 
whether the international math and science test scores were systematically related to the resources 
devoted to the schools in the years prior to the tests. They were not. If anything, their results suggested 
relatively better performance in those countries spending less on their schools. 

One final issue warrants consideration: The United States has never done well on these 
international assessments, yet its growth rate has been very high for a long period of time. The 
reconciliation is that quality of the labor force is just one aspect of the economy that enters into the 
determination of growth. A variety of factors clearly contribute, and these factors work to overcome 
any deficits in quality. These other factors may also be necessary for growth. In other words, simply 
providing more or higher-quality schooling may yield little in the way of economic growth in the 
absence of other elements, such as the appropriate market, legal, and governmental institutions to 
support a functioning modern economy. Past experiences investing in less developed countries that lack 
these institutional features suggest that schooling is not necessarily itself a sufficient engine of growth 
(e.g.. Easterly (2002); Pritchett (2001)). 

We take up these issues about quality of societal institutions in detail below, because the evidence 
suggests that they are very important and could potentially distort the analysis of the impacts of human 
capital. Here, however, we briefly consider the U.S. situation, both because it sets the stage for more 
recent analyses and because it helps to provide some balance to the overall picture of growth. 

Three other factors immediately come to mind as being important in U.S. growth and as potentially 
masking to detrimental effects of low school quality. First, almost certainly the most important factor 
sustaining the growth of the U.S. economy is the openness and fluidity of its markets. The United 
States maintains generally freer labor and product markets than most countries in the world. The 
government generally has less regulation on firms, and trade unions are less extensive than those in 
many other countries. Even broader, the United States has less intrusion of government in the operation 
of the economy, including lower tax rates and minimal government production through nationalized 
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industries. These factors encourage investment, permit the rapid development of new products and 
activities by firms, and allow U.S. workers to adjust to new opportunities. While identifying the precise 
importance of these factors is difficult, a variety of analyses suggest that such market differences could 

42 

be very important explanations for differences in growth rates. 

Second, over the twentieth century, the expansion of the education system in the United States 
outpaced that around the world. The United States pushed to open secondary schools to all citizens. 
With this came also a move to expand higher education with the development of land grant 
universities, the G.I. bill, and direct grants and loans to students. More schooling with less learning 
each year still yielded more human capital than found in other nations that have less schooling but 
more learning in each of those years."^^ 

Third, the analysis of growth rates across countries emphasizes quality of the elementary and 
secondary schools of the United States. It does not include any measures of the quality of U.S. colleges. 
By most evaluations, U.S. colleges and universities rank at the very top in the world. A number of 
models of economic growth in fact emphasize the importance of scientists and engineers as a key 
ingredient to growth. By these views, the technically trained college students who contribute to 
invention and to development of new products provide a special element to the growth equation. Here, 
again, the United States appears to have the best programs. 

Finally, in terms of endogeneity, it is important to note that our current work (reported below) 
replicates two of the tests in Hanushek and Kimko (2000). Specifically, in the context of extended 
growth analyses, we consider potential biases from including the East Asian countries, and we consider 
the resources-skills relationship. 

4.3 Some New Evidence 

To provide the most up-to-date evidence, we extend the existing evidence in several ways. The 
new evidence adds additional international student achievement tests not previously available and uses 
the most recent data on economic growth which allow an analysis for an even longer time period 
(1960-2000). Furthermore, the new data extend the sample of countries with available test-score and 
growth information from 31 countries in Hanushek and Kimko (2000) to 50 countries (see the appendix 
of this section for details on the data). In the subsequent sections, we will also use these data to analyze 



See, for example, Krueger (1974); World Bank (1993); Parente and Prescott (1994, 1999). 

This advantage has clearly ended as many OECD countries have expanded schools to exceed the quantity of 
schooling found in the United States (see Organisation for Economic Co-operation and Development (2003)). 
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effects of the distribution of educational quality at the bottom and at the top on economic growth, as 
well as interactions between educational quality and the institutional infrastructure of an economy. 

Our measure of the quality of education is a simple average of the mathematics and science scores 
over all the international tests depicted in Figure 4.1. We interpret this as a proxy for the average 
educational performance of the whole labor force. This measure encompasses overall cognitive skills, 
not just those developed in schools. Thus, whether skills are developed at home, in schools, or 
elsewhere, they are included in the growth analyses. As in the analyses underlying Figure 3.1, the 
source of the income data is version 6.1 of the Penn World Tables (cf. Heston, Summers, and Aten 
(2002)), and the data on years of schooling is an extended version of the Cohen and Soto (2001) data. 

The basic result is reported in column (2) of Table 4.1 and depicted graphically in Figure 4.2.^^ 
After controlling for the initial level of GDP per capita and for years of schooling, the test-score 
measure features a statistically significant effect on the growth in real GDP per capita in 1960-2000. 
According to this specification, test scores that are larger by one standard deviation (measured at the 
student level across all OECD countries in PISA) are associated with an average annual growth rate in 
GDP per capita that is two percentage points higher over the whole 40-year period. Below we relate 
these numbers to outcomes of school reform policies. 

When educational quality is added to a model that just includes initial income and years of 
schooling (column (1) of Table 4.1), the share of variation in economic growth explained by the model 
(the adjusted R^) jumps from 0.25 to 0.73. As in Figure 3.1, quantity of education is statistically 
significantly related to economic growth in a specification that neglects educational quality, but the 
association between years of schooling and growth turns insignificant and is reduced to close to zero 
once the quality of education is included in the model (see the bottom of Figure 4.2)."^^ In either 
specification, there is evidence for conditional convergence in the sense that countries with higher 
initial income tend to grow more slowly over the subsequent period. 



^ For a brief technical description of added-variable plots see Section 3.2 above. 

The result remains the same when the measure of years of schooling refers to the average between 1960 and 2000, 
rather than the initial 1960 value. The coefficient estimate on years of schooling in the 50-country sample of column (1) of 
Table 4.1 is somewhat smaller than the one reported for the 92-country sample reported in Figure 3.1, corresponding to the 
fact discussed above that the estimate tends to be smaller in high-income countries. 
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Table 4.1: Education as Determinant of Growth of Income per Capita, 1960-2000 





(1) 


(2) 


(3)“ 


(4) 


GDP per capita 1960 


-0.379 


-0.302 


-0.277 


-0.351 




(4.24) 


(5.54) 


(4.43) 


(6.01) 


Years of schooling 1960 


0.369 


0.026 


0.052 


0.004 




(3.23) 


(0.34) 


(0.64) 


(0.05) 


Test score (mean) 




1.980 


1.548 


1.265 






(9.12) 


(4.96) 


(4.06) 


Openness 








0.508 










(1.39) 


Protection against expropriation 








0.388 










(2.29) 


Constant 


2.785 


-4.737 


-3.701 


-4.695 




(7.41) 


(5.54) 


(3.32) 


(5.09) 


N 

R" (adj.) 


50 

0.252 


50 

0.728 


50 

0.741 


47 

0.784 



Dependent variable; average annual growth rate in GDP per capita, 1960-2000. f-statistics in parentheses. “ Regression 
includes five regional dummies. 
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Figure 4.2: Added-variable Plots of Growth and Education 
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The same pattern of results is preserved when we ignore any variation between world regions by 
including five regional dummies (column (3) of Table 4.1).^^ That is, even when considering just the 
variation that exists within each region, educational quality is significantly related to economic growth, 
with a coefficient estimate reduced from 1.98 to 1.55. 

One of the most important fundamental determinants of economic growth discussed in the recent 
literature is the institutional framework of the economy (see Section 4.5 below for details). The most 
common and powerful measures of the institutional framework used in empirical work are the openness 
of the economy to international trade and the security of property rights. When we add these two 
institutional variables to our model, they are jointly highly significant (column (4) of Table 4.1). But 
the positive effect of educational quality on economic growth is very robust to the inclusion of these 
controls, albeit slightly reduced in magnitude to 1.26. 

Other potential determinants of economic growth often discussed in the literature are fertility and 
geography. However, when we add the total fertility rate and common geographical proxies, such as 
latitude or the fraction of the land area of a country that is located within the geographic tropics, to the 
specification reported in column (4) of Table 4.1, neither of these additional variables is statistically 
significantly associated with economic growth. Furthermore, none of the other results, in particular for 
educational quality, are qualitatively affected. 

An important issue is whether the role of educational quality in the process of economic 
development differs between developing and developed countries. In Table 4.2, we divide the sample 
of countries into two groups in two different ways. First, in columns (5) and (6), we subdivide the 
sample into OECD countries and non-OECD countries. Results are remarkably similar, with the point 
estimate of the effect of educational quality slightly larger in non-OECD countries. However, the effect 
of educational quality on economic growth does not differ significantly between the two groups of 



The regions are East Asia, South Asia, Latin America, Middle East and North Africa, Sub-Saharan Africa, and the 
industrial countries. 

The proxy for openness used here is the fraction of years between 1960 and 1998 that a country was classified as 
having an economy open to international trade, based on five factors including tariffs, quotas, exchange rate controls, export 
controls, and whether or not a socialist economy (cf. Sachs and Warner (1995)). The proxy for security of property rights is 
an index of the protection against expropriation risk, averaged over 1985-1995, from Political Risk Services (a private 
company which assesses the risk that investments will be expropriated in different countries), ranging from 0 to 10 (high 
figures corresponding to low risk), as used by Acemoglu, Johnson, and Robinson (2001) and provided in McArthur and 
Sachs (2001). This measure of the risk of confiscation and forced nationalization of property is often used as a summary 
variable for institutional quality, and similar data were first used in this framework by Knack and Keefer (1995). 
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countries. The results remain qualitatively the same when openness and quality of institutions are again 
added as control variables. 



Table 4.2: Education as Determinant of Growth of Income per Capita, 1960-2000: Sub-samples 





(5) 

Developing 

countries® 


(6) 

OECD sample 


(7) 

Low-income 

countries'’ 


(8) 

High-income 

countries® 


GDP per capita 1960 


-0.262 


-0.301 


-0.063 


-0.294 




(1.77) 


(5.81) 


(0.28) 


(6.38) 


Years of schooling 1960 


0.025 


0.025 


0.006 


0.152 




(0.20) 


(0.26) 


(0.05) 


(1.70) 


Test score (mean) 


2.056 


1.736 


2.286 


1.287 




(6.10) 


(4.17) 


(6.98) 


(5.37) 


Constant 


-5.139 


-3.539 


-6.412 


-2.489 




(3.63) 


(1.96) 


(4.52) 


(2.86) 


N 

R" (adj.) 


27 

0.676 


23 

0.830 


25 

0.707 


25 

0.783 



Dependent variable: average annual growth rate in GDP per capita, 1960-2000. “ Non-OECD countries. Countries below 
sample median of GDP per capital 1960. Countries above sample median of GDP per capital 1960. 



Second, in columns (7) and (8), we subdivide the sample into countries above and below the 
sample median of initial GDP per capita. Again, the significant effect of educational quality remains 
robust in both sub-samples. Here, the effect of quality is considerably larger in the low-income 
countries, and the difference in the coefficients on test scores is statistically significant at the five 
percent level. Thus, if anything, the effect of educational quality is larger in developing countries than 
in developed countries. Furthermore, the robustness of the effect in the sub-samples is remarkable, in 
particular considering the small sample sizes of the specifications reported in Table 4.2. 

When adding an interaction term between years of schooling and test scores to the specification, 
the interaction term is insignificant in the full sample of countries. But in the sub-sample of low- 
income countries (with a GDP per capita below the sample median in 1960), the interaction term is 
significantly positive. At least in this sample, the returns to increased years of schooling increase with 
the quality of the education. Once there is a high-quality school system, it pays to keep children longer 
in school - but it does not if the school system does not produce skills. 



Similarly, the effect of educational quality remains robust in the two sub-samples of countries above and below the 
median of economic growth over the considered period, with the point estimate larger in the high-growth sample. 
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Importantly, the results do not appear to be an artifact of the specific time period, set of countries, 
or achievement measurement decisions. Extensive robustness tests confirm the strong and significant 
relationship between educational quality and economic growth. 

The results reported in Tables 4.1 and 4.2 are robust to several alternative approaches to measuring 
the quality of education. The results remain qualitatively the same when using only the tests performed 
at the level of lower secondary education, excluding any test in primary schooling or in the final year of 
secondary education. Arguably, test scores at the end of the secondary level, which combine the 
knowledge accumulated over primary and secondary schooling, may be more relevant for the human 
capital of the labor force than test scores that capture only the knowledge at the end of either primary or 
lower secondary school. At the same time, the duration of secondary education differs across countries, 
so that tests performed in the final year of secondary schooling in each country may not be as readily 
comparable across countries as the grade-based target populations. Further, given differing school 
completion rates, the test for the final year of secondary schooling may imply cross-country samples 
with differential selectivity of test takers. Neither the primary-school tests nor the tests in the final 
secondary year are crucial for the results. 

Furthermore, results are qualitatively the same when using only scores on tests performed since 
1995. These recent tests have not been used in the previous analyses and are generally viewed as 
having the highest standard of sampling and quality control. Fikewise, results are robust to using tests 
scores since 1995 for just lower secondary grades. 

A drawback of using only the more recent tests is that such an approach requires a strong version 
of the assumption that test performance is reasonably constant over time, because it relates test 
performance measured since 1995 to the economic data for 1960-2000. To make sure that higher 
previous economic growth is not driving the measured test performance, we also used a test score 
measure that disregards all tests since the late 1990s. Our results turn out to be robust, with a point 
estimate on the test score variable that is significantly higher, when we restrict the tests to only those 
conducted until 1995 (sample reduced to 34 countries) and until 1984 (22 countries)."^^ In sum, the 
results are not driven by either early or late test scores alone. 

The results are also robust to performing the analyses in two sub-periods, 1960-1980 and 1980- 
2000. The most recent period includes the Asian currency crisis and other major economic disruptions 



We can also use the average early test scores (either until 1984 or until 1995) as an instrument for the average of all 
test scores in a two-stage least-squares regression, in order to utilize only that part of the total test-score measure that can be 
traced back to the early test scores. Again, our results are robust to such an instrumental-variable approach. 
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which could affect the apparent impact of educational quality on growth - but they do not. Test scores 
exert a positive effect on growth in both sub-periods, while years of schooling remain insignificant in 
both sub-periods. In addition, all results remain qualitatively the same when we use test scores in math 
and science separately. 

Finally, as discussed in the previous section, one may worry about the extent to which East Asian 
countries are driving the association between educational quality and economic growth. As is obvious 
from Figure 4.2, several East Asian countries feature both high educational quality and high economic 
growth, and the top right corner of the figure is noticeably inhabited by East Asian countries. Still, the 
regression reported above that controls for regional dummies already showed that the association 
between educational quality and growth is not solely due to a difference between the group of East 
Asian countries and the rest of the countries. Furthermore, similar to the results reported in Hanushek 
and Kimko (2000), when we drop all ten East Asian countries from our sample of 50 countries, the 
estimate on educational quality remains statistically highly significant at a point estimate of 1.3. The 
significant effect in the sample without East Asian countries is also evident in the two separate sub- 
periods, with the point estimates being larger in the separate regressions. 

4.4 Distribution of Educational Quality and Economic Growth 

The results so far consider only the mean of the quality of education in a country. From a policy 
viewpoint, though, it is important to know whether different parts of the distribution of education affect 
growth differently. Eoosely speaking, is it a few “rocket scientists” at the very top of the distribution 
who are needed to spur economic growth, or is it “education for all” that is needed to lay a broad base 
at the lower parts of the educational distribution? Does educational performance at different points in 
the distribution of the population have separate effects on economic growth? 

To estimate such effects, we measure the share of students in each country that reaches a certain 
threshold of basic literacy at the international scale, as well as the share of students that surpasses an 
international threshold of top performance. As the two thresholds, we use 400 and 600 test-score 
points, respectively, on the transformed international scale as shown in Figure 4.1. 

The threshold of 400 points is meant to capture a notion of basic literacy. On the PISA 2003 math 
test, for example, this would correspond to the middle of the level 1 range which denotes that students 
can answer questions involving familiar contexts where all relevant information is present and the 
questions are clearly defined.^** While the PISA 2003 science test does not define a full set of 

Level 1 in PISA 2003 math was defined by students scoring from 358 to 420. 
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proficiency levels, the threshold of 400 points is used as the lowest bound for a basic level of science 
literacy (Organisation for Economic Co-operation and Development (2004), p. 292). In general, a level 
of 400 points means performance at one standard deviation below the OECD mean. The share of 
students achieving this level ranges from 18% in Peru to 97% in the Netherlands and Japan, with an 
international median of 86% in our sample. The threshold of 600 points captures the notion of very 
high performers, performing at more than one standard deviation above the OECD mean. The share of 
students achieving this level ranges from below 0.1% in Colombia and Morocco to 18% in Singapore 
and Korea and 22% in Taiwan, with an international median of 5% in our sample. 

When we enter the share of students above the two thresholds jointly in our growth model (see 
column (9) of Table 4.3), both turn out to be separately significantly related to economic growth. That 
is, both education for all and the share of absolutely top performers seem to exert separately 
identifiable effects on economic growth. These initial results should be viewed as suggestive rather 
than definite, not the least because of issues of multicollinearity between the two measures of 
educational quality which have a bivariate correlation of 0.73. Importantly, the relative size of the 
effects of performance at the bottom and at the top of the distribution depends on the specific 
specification used, and further research is needed to yield more detailed predictions. Nonetheless, the 
evidence strongly suggests that both dimensions of educational performance count for the growth 
potential of an economy (see Hanushek and WoBmann (in process) for greater detail). 

Additional specifications using different points of the distribution of test scores support this 
general view. The positive effect of educational quality is highly robust to measuring test-score 
performance at different percentiles of the distribution, such as the 25*, 75*, and 95* percentile, rather 
than at the mean. When including two measures at a time, they are always jointly, and often separately, 
significant. When including two percentile measures, it is not always clear whether the higher ones 
tend to dominate the lower ones or vice versa in different specifications, which are again hampered by 
strong multicollinearity of the test-score measures. The highest bivariate correlation among the 
different test-score measures is between mean performance and the share of students in a country 
reaching the threshold of 400 test-score points, at 0.99. Thus, mean performance seems to capture 
mainly the pattern of basic literacy across the countries in our sample. The specifications including two 



In contrast to the result of Caste 116 and Domenech (2002) based on years of schooling, we do not find a significant 
effect of inequality of educational quality per se on economic growth, but rather significant effects at different points of the 
distribution. 
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test-score variables remain qualitatively the same when including five regional dummies, and the 
pattern of significance also holds when the other control variables are included. 



Table 4.3: Educational Quality and Growth: Distribution and Institutional Interaction 





(9) 


(10) 


GDP per capita 1960 


-0.287 


-0.297 




(5.12) 


(5.64) 


Years of schooling 1960 


0.022 


-0.031 




(0.28) 


(0.41) 


Share of students above threshold of 400 


2.732 






(3.61) 




Share of students above threshold of 600 


12.880 






(4.35) 




Test score (mean) 




0.942 






(2.30) 


Openness 




0.732 






(2.13) 


Test score * openness 




1.609 






(2.34) 


Constant 


1.335 


3.814 




(2.97) 


(11.24) 


N 

R" (adj.) 


50 

0.719 


47 

0.785 



Dependent variable: average annual growth rate in GDP per capita, 1960-2000. f-statistics in parentheses. 



We can also divide our combined test-score measure into one using only the math tests and one 
using only the science tests. Both subject- specific test scores are significantly associated with growth 
when entered separately or jointly. There is some tendency for math performance to dominate science 
performance in different robustness specifications but overall performance in math and science carry 
separate weight for economic growth. 

In sum, different dimensions of the quality of education seem to have independent positive effects 
on economic growth. This is true both for basic and top dimensions of educational performance and for 
the math and science dimensions of performance. Because of the thin country samples, however, we 
trust the pattern of results more that the specific estimates. 

4.5 Institutions, Education and Growth 

In recent years, there has been an increasing emphasis on the role of economic institutions as the 
fundamental cause of differences in economic development (cf. Acemoglu, Johnson, and Robinson 
(2001, 2002, 2005)). As we saw in column (4) of Table 4.1, the quality of institutions as measured by 
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the protection against expropriation is indeed significantly related to economic growth in our model. A 
second measure of institutional quality, openness to international trade, also tends to be significantly 
related to economic growth, at least jointly with protection against expropriation. (Note that protection 
against expropriation and openness are strongly correlated, with a correlation coefficient of 0.71.) At 
the same time, though, the results show that, on average, the quality of education exerts a positive 

CO 

effect on economic growth independent of these measures of the quality of institutions. 

Still, the effect of educational quality on economic growth may differ depending on the economic 
institutions of a country. North (1990), for example, emphasizes that the institutional framework plays 
an important role in shaping the relative profitability of piracy versus productive activity. If the 
available knowledge and skills are used in the former rather than the latter activity, one may certainly 
expect the effect on economic growth to be substantially different, and maybe even to turn negative. 
Similarly, Murphy, Shleifer, and Vishny (1991) show that the allocation of talent between rent-seeking 
and entrepreneurship matters for economic growth: countries with relatively more engineering college 
majors grow faster and countries with relatively more law concentrators grow more slowly. Easterly 
(2002) argues that education may not have much impact in less developed countries that lack other 
facilitating factors such as functioning institutions for markets and legal systems. In a similar way, 
Pritchett (2001, 2006) suggests that due to deficiencies in the institutional environment, cognitive skills 
might have been applied to socially unproductive activities in many developing countries, rendering the 
average effect of education on growth across all countries negligible. While his result of negligible 
average growth effects of increases in educational quantity may have more to do with issues of 
measuring education than with perverse economic institutions, his point that the social returns to 
education may be low in countries with perverse institutional environments is certainly worth pursuing. 

Thus, in column (10) of Table 4.3, we add an interaction term between one of our institutional 

CO 

measures, openness to international trade, to our growth specification. The results suggest that 
openness and educational quality not only have significant individual effects on economic growth but 
also a significant positive interaction. This result is depicted in Figure 4.3. The effect of educational 
quality on economic growth is indeed significantly higher in countries that have been fully open to 



While Acemoglu, Johnson, and Robinson (2001, 2002, 2005) find that geographical features do not exert an effect 
on economic growth independent of institutions, this remains a disputed subject (cf., e.g., McArthur and Sachs (2001)). 
When we add proxies for geography such as location in the geographic tropics or latitude to our basic specification, these do 
not enter significantly and do not change the findings on institutions and on educational quality. 

To facilitate interpretation, the test-score variable is centered in this specification. 
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international trade than in countries that have been fully closed.^"^ The effect of educational quality on 
economic growth is significantly positive, albeit relatively low at 0.9, in closed economies, and it 
increases to a size of 2.5 in open economies. The reported result is robust to including the measure of 
protection against expropriation. When using protection against expropriation rather than openness to 
trade as the measure of quality of institutions, there is similarly a positive interaction term with 
educational quality, although it lacks statistical significance. 



Figure 4.3: The Effect of Educational Quality on Growth Depending on Openness 

Effect of test score on growth 




Notes: Estimated effect of average achievement test scores on the average annual rate of growth of real GDP per capita in 
1960-2000, depending on the degree of openness to international trade of a country. Author calculations; see Table 4.3, 
column (10). 



Jamison, Jamison, and Hanushek (forthcoming) find a similar result, in that the impact of cognitive skills on 
technical progress is strong in countries with open trade regimes and essentially zero in closed economies. 

There is a similar positive interaction effect between test scores and openness when openness is specified as a 
dummy for countries that have been closed for the majority of the time (openness below 0.3). 
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In sum, both the quality of the institutional environment and the quality of education seem 
important for economic development. Furthermore, the effect of educational quality on economic 
growth seems to be significantly larger in countries with a productive institutional framework, so that 
good institutional quality and good educational quality can reinforce each other in advancing economic 
development. Thus, the macroeconomic effect of education depends on other complementary growth- 
enhancing policies and institutions. Nonetheless, we still also find a significant positive growth effect 
of cognitive skills even in countries with a poor institutional environment. 

4.6 The Implications of Improved Quality 

It is important to understand the implications of policies designed to improve educational 
outcomes. The previous estimation provides information about the long run economic implications of 
improvements in educational quality. To understand better the impact of improved achievement, it is 
useful to relate policy reforms directly to the pattern of economic outcomes that are consistent with 
feasible improvements. 

Two aspects of any educational reform plan are important. First, what is the magnitude of the 
reform that is accomplished? Second, how fast does any reform achieve its results? While we return 
later to discuss the efficacy of a variety of potential schooling reforms, here we simply investigate the 
economic results that might be anticipated on the assumption that some set of schooling reforms 
actually leads to substantial achievement gains over an identified time period. 

Our analysis puts the economic implications in terms of standard deviations of test scores. To 
provide a benchmark, we consider a reform that yields a 0.5 standard deviation improvement in 
average achievement of school completers. This metric is hard to understand intuitively, in part because 
most people have experiences within a single country. It is possible, however, to put this in the context 
of the previous estimation. Consider a developing country with average performance at roughly 400 
test-score points, what we described as approximately minimal literacy on these tests. For example, on 
the PISA 2003 examinations, average achievement in Brazil, Indonesia, Mexico, and Thailand fell 
close to this level. An aggressive reform plan would be to close half the gap with the average OECD 
student, which would relate to a half standard deviation improvement.^^ 



As an alternative policy change, consider what it would mean if a country currently performing near the mean of 
OECD countries in PISA at 500 test-score points (for example, Norway or the United States in PISA 2000 or Germany in 
PISA 2003, see Figure 4.1) would manage to increase its educational quality to the level of top performers in PISA at 
roughly 540 test-score points (for example, Finland or Korea on either PISA test). Such an increase of 40 test-score points 
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The timing of this reform is also important. Two aspects of timing enter. First, such movement of 
student performance cannot be achieved instantaneously but requires changes in schools that will be 
accomplished over some time (say, through systematic replacement of teachers through retirements and 
subsequent hiring). The time path of any reform is difficult to specify, but achieving the change of 0.5 

c-j 

standard deviations described above for an entire nation may realistically take 20 to 30 years. 

Second, if the reforms succeed, their impact on the economy will not be immediate. In particular, 
the new graduates from school will initially be a small to negligible portion of the labor force. It will be 
sometime after the reform of the schools before the impact on the economy is realized. In other words, 
the prior estimates are best thought of as the long run, or equilibrium, outcomes of a labor force with a 
given educational quality. 

To show the impact of these elements of reform. Figure 4.4 simulates the impact on the economy 
of reform policies taking 10, 20, or 30 years for a 0.5 standard deviation improvement in student 
outcomes at the end of upper secondary schooling - what we label as a “moderately strong knowledge 
improvement.” For the calibration, policies are assumed to begin in 2005 - so that a 20-year reform 
would be complete in 2025. The actual reform policy is presumed to operate linearly such that, for 
example, a 20-year reform that ultimately yielded Vi standard deviation higher achievement would see 
the performance of graduates increasing by 0.025 standard deviations each year over the period. It is 
also necessary to characterize the impact on the economy, which we assume is proportional to the 

CO 

average achievement levels of prime age workers. Finally, for this exercise we project the growth 



amounts to 0.4 standard deviation. These calculations also point out the asymmetry of the test score distribution with a 
considerably longer left tail of the country distribution. 

Rough calculations of the relationship between changes in teacher quality and results are found in Hanushek (2005). 
These are based on assumptions of teacher turnover that mirror the U.S. experience. 

We specifically assume that the relevant achievement levels depend on workers in the first 35 years of their work 
life and calculate the average achievement levels based upon the progressive improvements of school completers under the 
different reform periods. The full impact of an educational reform is not felt until 35 years past completion of the reform, 
e.g., for 45 years with a 10-year reform period. To our knowledge, no past work has indicated how achievement impacts 
would be felt on the economy. Our linear impact that depends on the workforce average is probably conservative, because 
new technologies that rely on better trained workers could probably be introduced once there is a substantial number of 
more skilled workers available instead of relying on improvement throughout the workforce. Alternatively, improvements 
that depend on innovation may depend most on the stock of young, very highly skilled workers, and the impact of 
educational improvements might be quite quick. 
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impact according to the basic achievement model that also includes the independent impact of 
economic institutions - column 4 of Table 4.1. 



Figure 4.4: Improved GDP with Moderately Strong Knowledge Improvement (0.5 s.d.) 




year 

“ “ 10-year reform 20-year reform 30-year reform ““ typical education spending 



The figure indicates how much larger the level of GDP is at any point after the reform policy is 
begun as compared to that with no reform. In other words, the estimates suggest the increase in GDP 
expected over and above any growth resulting from other factors. 

Obviously, for any magnitude of achievement improvement, a faster reform will have larger 
impacts on the economy, simply because the better workers become a dominant part of the workforce 
sooner. But, the figure shows that even a 20- or 30-year reform plan has a powerful impact on GDP. 
For example, a 20-year plan would yield a GDP that was five percent greater in 2037 (compared with 
an economy with no increase in educational quality). The figure also plots 3.5 percent of GDP, an 
aggressive spending level for education in many countries of the world. Five percent of GDP is 
significantly greater than the typical country’s spending on all primary and secondary schooling, so that 
it is truly a significant change that would permit the growth dividend to more than cover all of primary 
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and secondary school spending. But even a 30-year reform program (that would not be fully 
accomplished until 2035) would yield more than five percent higher real GDP by 2041. 

Projecting these net gains from school quality further past the reform period shows vividly the long 
run impacts of reform. Over a 75 year horizon, a 20-year reform yields a real GDP that is 36 percent 
higher than would be with no change in educational quality. 

It must nonetheless be clear that these effects represent the result from actual gains in educational 
outcomes. We return to this below, but there have been many attempts around the world to improve 
student outcomes, and many of these have failed to yield actual gains in student performance. Bad 
reforms, those without impacts on students, will not have these impacts. 

This simulation shows that the previous estimates of impacts of educational quality on growth have 
really large impacts on national economies. At the same time, while the rewards are large, they also 
imply that policies must be considered across lengthy time periods and require patience - a patience 
that is not always clear in national policy making. We return below to some indications of what kind of 
speed of educational improvement may be feasible. 

These reforms must also be put in a broader perspective, because other kinds of institutional 
changes and investments will also take substantial time. Changing basic economic institutions, for 
example, can seldom be accomplished over night, and their impacts will take time before the economy 
adjusts. Thus, in large part, our calculations about the time path of benefits differ more in the fact that 
we make it explicit than in its pattern compared to a variety of other economic policies. 
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Appendix: Data on Quality of Education 

The analysis in this paper relies upon a variety of cognitive achievement tests. The central analysis 
of growth and macroeconomic performance uses information from the international tests of a set of 
nations voluntarily participating in a cooperative venture under the International Association for the 
Evaluation of Educational Achievement (lEA) and from the Organisation for Economic Co-operation 
and Development (OECD). The most recent of the lEA tests is the Trends in International Mathematics 
and Science Study (TIMSS) for 2003, although there is a much longer history identified below. The 
OECD tests, called the Programme for International Student Assessment (PISA), began in 2000 and 
cover all OECD countries plus others. 

These tests have different groups of countries, sampling of students, and perspectives on what 
should be tests (see Neidorf, Binkley, Gattis, and Nohara (2006)). Our approach is to aggregate across 
the variety of tests for each country in order to develop a composite measure of performance. Perhaps 
the most important issue is whether or not the tests are measuring a common dimension of cognitive 
skills. The TIMSS tests of math and science are developed by an international panel but are related to 
common elements of primary and secondary school curriculum, while the PISA tests are designed to be 
assessments of more applied ideas. In our development of a common metric we also employ data from 
the U.S. National Assessment of Educational Progress (NAEP). That test, which is conceptually closest 
to the TIMSS tests except that it relates more directly to U.S. curriculum, provides information over 
time on a consistent basis. 

Part of the analysis on individual returns relied on the International Adult Eiteracy Survey (lAES), 
a set of tests given to 20 countries between 1994-1998. These tests cover several functional areas 
including: Prose Eiteracy - the knowledge and skills needed to understand and use information from 
texts including editorials, news stories, poems, and fiction; Document Eiteracy - the knowledge and 
skills required to locate and use information contained in various formats, including job applications, 
payroll forms, transportation schedules, maps, tables, and graphics; and Quantitative Eiteracy - the 
knowledge and skills required to apply arithmetic operations, either alone or sequentially, to numbers 
embedded in printed materials, such as balancing a checkbook, calculating a tip, completing an order 
form, or determining the amount of interest on a loan from an advertisement. They were designed to be 
very practical. 

Interestingly, the TIMSS tests with their curricular focus and the PISA tests with their real-world 
application focus are highly correlated at the country level. Eor example, the correlation coefficients 
between the TIMSS 2003 tests of 8* graders and PISA 2003 tests of I5-year-olds across the 19 
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countries participating in both are 0.87 in math and 0.97 in science, and they are 0.86 in both math and 
science across the 21 countries participating both in the TIMSS 1999 tests and the PISA 2000/02 tests. 
Similarly, there is a high correlation at the country level between the curriculum based student tests of 
TIMSS and the practical literacy adult examinations of lALS (Hanushek and Zhang (2006)). Tests with 
very different foci and perspectives tend, nonetheless, to be highly related, lending support to our 
approach of aggregating different tests for each country. 

The general idea behind our approach to aggregation is that of empirical calibration. We rely upon 
information about the overall distribution of scores on each test to compare national responses. This 
contrasts with the psychometric approach to scaling that calls for calibrating tests through use of 
common elements on each test. In reality, each of the testing situations described below is a separate 
activity with no general attempt to provide common scaling. 

The strength of our approach is that different tests across a common subject matter tend to be 
highly correlated at both the individual and aggregate level. Thus, the distributional information that we 
use is closely related to variations in individual performance levels. 

As shown in Table A4.1, there are data from international student achievement tests on twelve 
international testing occasions. Containing separate tests in different subjects and at different age 
groups, these testing occasions yield 36 separate test observations altogether, each with between 11 and 
45 participating countries with internationally comparable performance data. Most of the tests were 
conducted by the lEA, with the exception of the OECD-conducted PISA tests. 

In order to make performance on the different international tests comparable, Hanushek and 
WoBmann (in process) develop a common metric to adjust both the level of test performance and the 
variation of test performance through two data transformations. Eirst, because the United States has 
both participated in all of the international tests and has maintained its own longitudinal testing (the 
NAEP), Hanushek and WoBmann (in process) calibrate the U.S. international performance over time to 
the external standard - thus benchmarking each of the separate international tests to a comparable level. 
Second, while this provides a relative comparison of countries taking each test over time, it is also 
necessary to establish the variance on the tests so that direct compatibility of countries taking different 



In this study, we do not include the two tests conducted by the International Assessment of Educational Progress 
(lAEP) in 1988 and 1991, because they used the U.S. NAEP test as their testing instrument, which is geared to the U.S. 
curriculum and may thus introduce bias to the international testing. By contrast, the tests included here are not associated 
with the curriculum in any particular country, but have been devised in an international cooperative process between all 
participating countries. 
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tests can be established. The calibration of the dispersion of the tests relies on holding the score 
variance constant within a group of countries with stable education systems (defined in terms of 
secondary school attendance rates) over time. For this, Hanushek and WoBmann (in process) use the 
thirteen OECD countries who had half or more students completing upper secondary education around 
the beginning of international testing in the 1970s as the “stable” country group, and standardize 
variances to their group performance on the 2000 PISA tests. The details of the transformation are 
found in Hanushek and WoBmann (in process). 



Table A4.1: The International Student Achievement Tests 



Abbr. 


Study 


Year 


Subject 


Age“’^ 


Countries'^ 


1 FIMS 


First International Mathematics Study 


1964 


Math 


13,FS 


11 


2FISS 


First International Science Study 


1970-71 


Science 


10,14,FS 


14,16,16 


3 FIRS 


First International Reading Study 


1970-72 


Reading 


13 


12 


4 SIMS 


Second International Mathematics Study 


1980-82 


Math 


13,FS 


17,12 


5 SISS 


Second International Science Study 


1983-84 


Science 


10,13,FS 


15,17,13 


6 SIRS 


Second International Reading Study 


1990-91 


Reading 


9,13 


26,30 


7 TIMSS 


Third International Mathematics and Science 
Study 


1994-95 


Math/Science 


9(3h-4), 

13(7h-8),FS 


25,39,21 


8 TIMSS-Repeat 


TIMSS-Repeat 


1999 


Math/Science 


13(8) 


38 


9 PISA 2000/02 


Programme for International Student 
Assessment 


2000H-02 


Reading/ 

Math/Science 


15 


31-hIO 


10 PIRLS 


Progress in International Reading Literacy 
Study 


2001 


Reading 


9(4) 


34 


1 1 TIMSS 2003 


Trends in International Mathematics and 
Science Study 


2003 


Math/Science 


9(4), 13(8) 


24,45 


12 PISA 2003 


Programme for International Student 
Assessment 


2003 


Reading/ 

Math/Science 


15 


40 



Notes: “ Grade in parentheses where grade level was target population. FS = final year of secondary education. Number 
of participating countries that yielded internationally comparable performance data. 



To create our measure of quality of education employed in this study, we use a simple average of 
the transformed mathematics and science scores over all the available international tests in which a 
country participated, combining data from up to nine international testing occasions and thirty 
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individual test point observations.^*’ This procedure of averaging performance over a forty year period 
is meant to proxy the educational performance of the whole labor force, because the basic objective is 
not to measure the quality of students but to obtain an index of the quality of the workers in a country. 
If the quality of schools and skills of graduates are constant over time, this averaging is appropriate and 
uses the available information to obtain the most reliable estimate of quality. If on the other hand there 
is changing performance, this averaging will introduce measurement error of varying degrees over the 
sample of economic data (1960-2000). The analysis in Hanushek and WoBmann (in process) shows 
some variation over time, but there is no clear way to deal with this here. 

When looking at effects of the distribution of educational quality, we go beyond the mean 
performance of a country’s students and calculate the share of students above a certain test-score 
threshold, as well as the performance of students at different percentiles of a country, both based on our 
transformed metric of scores. This requires going into the details of the student- level micro data for 
each international test, which we can do for each test in mathematics and science with the exception of 
FIMS, where the micro data seem no longer accessible. See Hanushek and WoBmann (in process) for 
details. 

By now, there are a total of 77 countries that have ever participated in any of the nine international 
student achievement tests in mathematics and science. 50 of these are included in the analyses of 
economic growth. 25 countries are not included in the growth database due to lack of data on economic 
output or because they drop out of the sample for a standard exclusion criterion in growth analyses (15 
former communist countries, 3 countries for which oil production is the dominant industry, 2 small 
countries, 3 newly created countries, 2 further countries lacking early output data). Two countries 
(Nigeria and Botswana) turn out to be strong outliers in the growth regressions and are therefore 
dropped from the sample.^’ 



Note that the descriptive representation in Figure 4. 1 also reports performance on the reading tests, which are not 
used for the growth analyses here because they may be affected by testing in different languages and may not be easily 
combined into a common one-dimensional scale with mathematics and science tests. 

There are four countries with test-score data which have a few years missing at the beginning or end of the 1960- 
2000 period on the income data in the Penn World Tables. In particular, data for Tunisia starts in 1961 rather than 1960, and 
data for Cyprus (1996), Singapore (1996), and Taiwan (1998) end slightly earlier than in 2000. These countries were 
included in the growth regressions by estimating average annual growth over the available 36-39 year period rather than the 
whole 40 year period. 
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5. Where Does the Developing World Stand? 

Given the crucial importance of cognitive skills for economic development, it is telling to 
document how the developing countries fare in this regard. In the following, we document both the 
quantity of schooling and quality of education achieved by developing countries from an international 
perspective, and this vividly illustrates the magnitude of the task at hand. 

5.1 Lack of Quantity of Schooling 

The disadvantages of less developed countries in terms of educational enrollment and attainment 
have been well documented and are well known. To provide an aggregate picture. Figure 5.1 presents 
the share of children aged 15-19 who have completed grade 9, dropped out between grades 5 and 9 or 
between grades 1 and 5, or never enrolled, for eight regions of developing countries. The shares are 
unweighted averages of the available countries in the Demographic and Health Surveys (DHS) and 
similar surveys (cf. Filmer (2006)), conducted in a large number of developing countries. These 
household survey data are much more reliable than the administrative data on school enrollment rates 
traditionally used, because the administrative data available in many developing countries often 
overstate actual enrollment rates. Further, the self-reported data on the highest grade completed is more 
relevant for actual education than enrollment rates, which are affected by grade repetition and the like. 

While almost all OECD countries have universal school attainment to grade 9, essentially all 
developing regions are far from that. In the average African country in the data, only 13% of each 
cohort finishes grade 9, and less than 30% in Central America and South and East Asia. Even in South 
America, this figure is only 43%, although on the other hand only 17% of a cohort do not complete 
grade 5 (which often serves as an initial indication of basic literacy and numeracy rates). In West and 
Central Africa, 59% of each cohort do not even complete grade 5, and 44% never enroll in school in 
the first place. It is notable across countries, however, that the lack of grade 5 attainment is at least as 
often due to dropping out of school than due to never enrolling. In any event, while the pattern of 
educational attainment varies greatly across countries and regions, the lack of quantitative educational 
attainment from universal completion of basic education - be it grade 5 or grade 9 - is immense in the 
majority of developing countries. 



Grades 5 and 9 are chosen as two possible definitions of basic schooling. The age range 15-19 is chosen to balance 
the relevance of the data for most recent conditions against the censoring problem of some children still being in school. 
The basic patterns observed are the same when using estimates adjusted for the censoring (Pritchett (2004)). 
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Figure 5.1: Lack of Educational Attainment in Developing Countries 



East and South Africa 
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Note: Based on Pritchett (2004). 

Focusing on this dimension of schooling quantity, many policy initiatives of national governments 
and international development agencies have tried to increase the educational attainment of the 
population. The data in Figure 5.1 show that there remains a long way to go. But even this dire picture 
may understate the challenge that becomes apparent when quality is also considered. 

5.2 Lack of Quality of Education 

The description of school completion unfortunately ignores the level of cognitive skills actually 
acquired. Completing 5 or even 9 years of schooling in the average developing country does not 
necessarily mean that the students have become functionally literate in terms of basic cognitive skills. 
As a recent report by the World Bank Independent Evaluation Group (2006) documents, high priority 
was accorded to increasing enrollment in primary schools in developing countries over the past 15 
years, but much less attention was directed to whether children are learning adequately. In Figure 4.1 
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above, we have already documented the particularly low levels of mean performance of students 
attending school in basically all the developing countries that have participated in at least one of the 
international student achievement tests. But of course, mean performance can hide a lot of dispersion 
that exists within countries, and the prior analyses of growth (Section 4.4) show that there is separate 
information at different percentiles of the test-score data. 

Figures 5.2 and 5.3 depict the share of students in a country that surpasses the thresholds of 400 
and 600 test-score points on the transformed scale of the combined international tests - the same 
measure and thresholds that we used in the growth analyses of Section 4.4. Figure 5.2 shows the 
sample of 50 countries which forms the basis of our growth analyses, and Figure 5.3 shows the 
remaining 27 countries that participated in one of the international student achievement tests but lack 
internationally comparable GDP data for the 1960-2000 period that would allow them to be included in 
the growth analyses. 

When considering the basic educational achievement of students, we are interested in the share of 
students who surpass the threshold of 400 test-score points, which may be viewed as a rough threshold 
of basic literacy in mathematics and science. As is evident from the figures, this share varies 
immensely across countries. In countries such as Japan, the Netherlands, Korea, Taiwan, and Finland, 
less than 5% of tested students fall below this literacy threshold. By contrast, in many of the developing 
countries participating in the international achievement tests, more than half of the tested students do 
not reach this threshold of literacy. The countries with the largest shares of students who are 
functionally illiterate are Peru (82%), Saudi Arabia (67%), Brazil (66%), Morocco (66%), South Africa 
(65%), Botswana (63%), and Ghana (60%). In these countries, more than 60% of those in school do not 
reach a level of basic literacy in cognitive skills. It should be noted that the group of developing 
countries participating in the international tests is probably already a select sample from all developing 
countries, and, furthermore, the children enrolled in school at the different testing grades are probably 
only a select group of all children of the respective age in these countries. 
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Figure 5.2: Share of Students below 400 (“Illiterate”), between 400 and 600 and above 600 



Italy 



Spam 



Cypfus 

Pcftuyal 



Greece 



'/ni(yiay 

Turkey 



Philppir>es 

Indonesia 

Tifiisia 



Below 400 



Between 400 and 600 



Above 600 



Source: Hanushek and WoBmann (in process), based on several international tests; see text for details. 
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Figure 5.3: Share of Students below 400 (“Illiterate”), between 400 and 600 and above 600 
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Source: Hanushek and WoBmann (in process), based on several international tests; see text for details. 



5.3 The Size of the Task at Hand: Schooling Quantity and Educational Quality Combined 

We have seen that developing eountries are severely lacking in terms of both schooling quantity 
and quality of education. Figure 5.4 shows the combination of the two. For the 14 countries that both 
have reliable attainment data from the household surveys and have participated in the international 
student achievement tests, we combine educational attainment of 15-to 19-year-olds from the latest 
available year with test scores at the end of lower secondary education (eighth grade or 15-year-olds) 
from a year close by.^^ This allows us to calculate rough shares among recent cohorts of school-leaving 



Specifically, the years of the household survey data and the associated tests (where TIMSS always refers to the 
respective 8* grade sub-tests) are as follows: Albania and Peru: attainment data for 2000, combined with test scores from 
PISA 2002; Armenia: 2000 and TIMSS 2003; Brazil: 1996 and PISA 2000; Colombia: 2000 and TIMSS 1995; Egypt, 
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age of how many were never enrolled in school, how many dropped out of school by grade 5 and by 
grade 9, how many finished grade 9 with a test-score performance below 400 which signals functional 
illiteracy, and finally how many finished grade 9 with a test-score performance above 400. Only the 
last group can be viewed as having reached basic literacy in cognitive skills (cf. Pritchett (2004); 
WoBmann (2004)). 

As the results in Figure 5.4 reveal, in 11 of the 14 countries, the share of fully literate students in 
recent cohorts is less than one third. In Ghana, South Africa, and Brazil, only 5%, 7%, and 8% of a 
cohort, respectively, reach literacy. The remaining more than 90% of the population are illiterate: 
because they never enrolled in school; because they dropped out of school at the primary or early 
secondary level; or because even after completing lower secondary education, their grasp of basic 
cognitive skills was so low that they have to be viewed as fundamentally illiterate. In contrast, 55% of a 
cohort in Armenia and 63% in Moldova can be viewed as literate at the end of lower secondary 
schooling. 

An example of a basic test question from one of the international achievement tests can, perhaps 
better than anything, illustrate the scope of the problem in developing countries. One question asked to 
8* graders in TIMSS 2003 was: “Alice ran a race in 49.86 seconds. Betty ran the same race in 52.30 
seconds. How much longer did it take Betty to run the race than Alice? (a) 2.44 seconds (b) 2.54 
seconds (c) 3.56 seconds (d) 3.76 seconds.” While 88% of 8*-grade students in Singapore, 80% in 
Hungary, and 74% in the United States got the correct answer (a), only 19% of students in 8* grade in 
Saudi Arabia, 29% in South Africa, and 32% in Ghana got the correct answer (cf. Pritchett (2004) for a 
similar example). Random guessing would have yielded 25% correct on average. 



Ghana, and Morocco: 2003 and TIMSS 2003; Indonesia: 2002 and average of TIMSS 2003 and PISA 2003; Moldova: 2000 
and average of TIMSS 1999 and TIMSS 2003; Philippines: 2003 and average of TIMSS 1999 and TIMSS 2003; South 
Africa: 1999 and TIMSS 1999; Thailand: 2002 and PISA 2003; Turkey: 1998 and TIMSS 1999. 
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Figure 5.4: Types of Lack of Education in Selected Countries (15- 19-year-olds) 
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Note: Own calculations based on Filmer (2006) and micro data from different international student achievement tests. 
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When we combine data on quantitative educational attainment and qualitative achievement of 
cognitive skills for the countries with reliable data on both dimensions, it becomes apparent that the 
task at hand is truly staggering in most developing countries. In many developing countries, the share 
of any cohort that completes lower secondary education and passes at least a low benchmark of basic 
literacy in cognitive skills is below one person in ten. Thus, the education deficits in developing 
countries seem even larger than generally appreciated. Several additional references for examples of 
extremely low educational performance of children even after years of schooling from different 
developing countries are provided in Pritchett (2004). With this dismal state of the quantity and quality 
of education in most developing countries, the obvious remaining question is, what can be done? 
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6. Educational Spending and Student Outcomes 

The previous discussion has stressed the importance of cognitive skills for economic outcomes - 
individual earnings, income distribution, and economic growth. By virtually any standard, the evidence 
indicates a strong causal impact of skills on these outcomes. Yet simply knowing that skill differences 
are important does not provide a guide to policies that might promote more skills. Indeed, a wide 
variety of policies have been implemented within various countries without much evidence of success 
in either achievement or economic terms. This history has led some to believe that education is not a 
useful policy lever to achieve economic development goals. 

We believe that the disappointing results of the past generally reflect pursuing policies for which 
there is little empirical support. But the past outcomes should not be generalized to statements about all 
policies. 

Substantial research has gone into understanding why achievement differs across students and 
across countries. While controversies have existed about the interpretation of various evidence, some 
considerable agreement now exists about what kinds of approaches are unreliable avenues for change. 
There is perhaps less agreement on the precise approaches that might be followed, but even here there 
is a growing consensus about the general sets of policies that have shown promise based on more 
credible research approaches. 

The work on achievement determinants is generally labeled “education production functions.” The 
extensive work has taken a variety of perspectives and approaches. The general objective is to sort out 
the causal impacts of school factors (things that can potentially be manipulated through policy) from 
other influences on achievement including family background, peers, neighborhood influences, and the 
like. 

The most important generally available evidence relates to the effects of resources. Many policies 
undertaken involve substantial flows of resources - direct spending, changes in teacher salaries, 
reductions in class size, and the like - made within the context of current school organization. The 
empirical evidence clearly documents the difficulties with such policies. Simply providing more 
resources gives, according to the available evidence, little assurance that student performance will 
improve significantly. 

This section reviews the international evidence on resources, spending, and student outcomes. The 
next section discusses larger questions of institutional design. 
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6.1 Cross-country Evidence on Resources 

The easiest starting point is a simple picture of outcomes across countries. Figure 6.1 presents the 
international association between spending levels and math performance in the latest international test, 
the 2003 cycle of PISA conducted by the OECD. 



Figure 6.1: Expenditure per Student and Student Performance across Countries 

Association between average math performance in PISA 2003 and cumulative expenditure on educational institutions per 
student between age 6 and 15, in US dollars, converted by purchasing power parities. 
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Source: Organisation for Economic Co-operation and Development (2004), pp. 102 and 358; WoBmann (forthcoming-a). 



The dark line in Figure 6.1 represents the regression line for PISA scores on cumulative 
expenditure (age 6-15).®“* Taken literally, this line indicates that $60,000 per student in additional 
expenditure (a quadrupling of spending in the low spending countries) is associated with about a half 
standard deviation improvement in scores. But, this relationship is almost entirely due to the two 

^"^Organisation for Economic Co-operation and Development (2004), p. 101 uses this figure to conclude that there is a 
positive relationship between expenditure and student performance, because the correlation is weakly significant (in 
statistical terms) when considering all countries and ignoring other effects (corresponding to the weakly upward sloping 
regression line). 
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spending and performance outliers - Mexico and Greece. It is impossible to believe that the only 
difference between these two countries and the remainder is just their spending on schools. For 
example, there are four countries spending less than Greece but with their students significantly 
outperforming Greek students. Omitting Mexico and Greece, there is no relationship whatsoever 
between expenditure and performance (see the lighter regression line in Figure 6.1). On average, 
countries with high educational expenditure perform at the same level as countries with low 
educational expenditure. 

This picture is only the most recent demonstration that spending alone is not associated with 
student performance. This picture can be found with previous international student tests like TIMSS.“ 
As discussed previously, Hanushek and Kimko (2000) take into account other factors including 
parental education in their investigation of earlier test score differences but find no relationship with 
expenditure per pupil, expenditure as a fraction of GDP, or pupil-teacher ratios. Similarly, even when 
numerous family-background and school features are considered in cross-country student-level 
microeconometric regressions, these results hold.*’*’ 

Nor does the picture change when we look at changes in expenditure over time within individual 
countries. As depicted in Figure 6.2, an in-depth study of changes over time in educational expenditure 
and student performance has shown that educational expenditure per student has increased substantially 
in real terms in all considered OECD countries between the early 1970s and the mid-1990s.®’ Yet, 
comparing scores in 1970 and 1994/95 suggests that no substantial average performance improvements 
for students have occurred in these countries.®^ 



See, for example, WoBmann (2002), Section 3.2. 

“ See WoBmann (2001); WoBmann (2003a), pp. 1 17-170; Fuchs and WoBmann (forthcoming). 

For details of the analysis, see Gundlach, WoBmann, and Gmelin (2001), pp. C135-C147; and WoBmann (2002), 
Section 3.3. Tests in the two time periods are linked using U.S. performance data. Increases in educational expenditure are 
adjusted not only for average inflation, but also for the so-called “Baumol effect” of increasing costs in service sectors 
without productivity increases. Three different approaches of calculating price deflator for the schooling sector that account 
for this effect are averaged in the depiction of Figure 6.2. Not adjusting for this and using U.S. national data, real current 
expenditure per student in the United States increased by 2.4 percent each year on average over 1970-2003. 

One potential explanation for this bivariate pattern might of course be that students’ family backgrounds might have 
deteriorated on average. Today’s students may lack many of the basic capabilities required for a successful education and 
may thus be increasingly expensive to educate. Such effects may play a significant role in countries with a large inflow of 
immigrant students or with rising levels of poverty. But on average, parents in the considered countries enjoy higher 
incomes and are better educated than 25 years ago, and the number of children per family has declined. Hence children may 
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Figure 6.2: Expenditure per Student and Student Performance over Time 

Association between change in student performance (1994/95 relative to 1970) and change in educational expenditure 
(average annual rate of change of real educational expenditure per student between 1970 and 1994/95 in percent). 
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This evidence, while covering a wide range of countries, is again lightly represented in developing 
countries, because developing countries have not participated very frequently in the various 
international tests. This nonparticipation is itself an important policy issue. It is difficult to know what 
improvements are needed or whether any policy changes are having an impact without accurately 
measuring student performance. 

Comparisons across countries are nonetheless difficult in this area for a number of reasons. First, 
to compare spending, it is necessary to deal with exchange rate differences and with different earnings 

actually start schooling with better basic capabilities than ever before. Nonetheless, these issues are better dealt with directly 
through econometric analyses. 
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and spending levels. Second, because the schooling institutions can vary in a range of ways, it is 
difficult to ensure that other influences on achievement are properly accounted for in the analysis. For 
these reasons, we turn to data generated within separate countries. 

6.2 Within-country Evidence - Developed Countries 

For policy deliberations, information on resources from within individual countries is perhaps 
more appropriate than cross-country information. Most importantly, it generally allows for much more 
detailed analyses that can more reliably isolate factors subject to policy decisions. The open question, 
to which we return below, is whether information gleaned from the schools in one country translates 
into reliable advice in other countries. We begin by reviewing evidence from developed countries and 
move to developing countries in the next section. 

Researchers have studied the determinants of student achievement for more than 40 years. The 
work was begun in the United States in the “Coleman Report” (Coleman et al. (1966)), which 
introduced the idea of using statistical analysis to relate various inputs of schools to student outcomes. 
This work also underscored the importance of including nonschool factors by demonstrating that 
family differences were very important in explaining variations in achievement across students. While 
this original study has been subjected to considerable criticism, it led to an extensive line of research. 

The general picture about school resources in developed countries is now well known and has been 
reviewed in a variety of places (see Hanushek (2002b), Hanushek (2003), WoBmann (2005b)). Here we 
simply review the highlights and point to the key open issues. 

The available studies concentrate on various common inputs to schools such as teacher experience 
or class size. These factors are both readily available in both administrative and survey data sets and 
frequently identified as the focus of policy. The available econometric evidence now includes literally 
hundreds of separate estimates within the U.S. and other developed countries. Quite uniformly, 
however, there is little strong evidence that any of the following factors has a consistent impact on 
achievement: the level of teacher education, the pupil-teacher ratio, the characteristics of 
administration, or the facilities of the school. Specifically, aggregating results across studies, a minority 
of estimates are statistically different from zero (at the 5 percent level or better), and the studies do not 
even uniformly indicate improvements in performance with increased resources. 

The two factors that more frequently appear to positively influence student outcomes are teacher 
experience and measures of teacher achievement tests. We discuss these below under the assessment of 
the quality of studies. 



63 




A second line of studies focuses on financial inputs. A number of studies simply relate spending 
per student to achievement or capture teacher differences by teacher salaries. These studies also fail to 
show a consistent relationship between financial resources and achievement. 

These results have been controversial. A variety of debates have taken place around the correct 
interpretation of prior work (see, for example, Burtless (1996)). The most important line of debate has 
involved study quality and whether or not these works adequately control for various inputs that might 
complicate the interpretation of resources. For example, the statistical models may not adequately 
account for other inputs that affect achievement such as the quality of family inputs; the estimates 
might then erroneously attribute the higher achievement due to better family factors to some of the 
characteristics of schools. 

It is easiest to think of these problems as the same as those discussed under causality when looking 
at the aggregate economic outcomes. Take the simple example of the kinds of complications that could 
come from compensatory inputs to schooling. Specifically, consider a school in which some students 
are performing poorly and are falling behind the rest of the students in the school. A common policy 
might provide extra resources to these poor performers in an effort to help them catch up. For example, 
the school might provide smaller classes for the poor performers so that these students could get extra 
attention. In such a case, if the study did not adequately account for their prior performance, the 
statistical analysis might indicate that smaller classes were associated with lower performance - all 
because the performance of the students “caused” the class size to be smaller. 

A variety of approaches have been developed to circumvent problems such as these and to obtain 
estimates of the causal effects. One direct way has been to estimate statistical achievement models that 
consider “value added” of schools by introducing prior achievement levels of students into the 
analysis. When just the results of resource impacts from the more refined approaches are considered, 
there is little change in the overall picture. The common measures of school resources still do not have 
a systematic impact on performance. 

Alternative approaches beyond these statistical procedures have utilized specialized models of 
achievement, have introduced random assignment to different programs, or have capitalized on 
particular characteristics of schools in order to obtain better estimates of the impact of differing 
resources. A large part of this effort has been directed at uncovering the impacts of class size on student 
achievement. This effort reflects two aspects of class size: First, class size reduction is a popular and 



The details of these approaches can be seen in Hanushek (2002b). 
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frequently considered policy change; second, it is a very well-defined and discrete treatment that is 
amenable to various analytical approaches (as contrasted with vaguer concepts such as teacher quality). 

The extensive focused analysis on the causal effects of class size reduction unfortunately does not 
provide any clear indication that such policies consistently improve student outcomes. This work, again 
well- summarized elsewhere (Hanushek (2003), WoBmann (forthcoming-a)), sometimes indicates 
benefits of reductions and equally as frequently does not. From these analyses, general class size 
reductions do not appear to be worth their expense in terms of student outcomes. 

The interpretation of this work is important, because it also carries over to discussions of 
developing country policies. A simplistic view of this argument - convenient as a straw man in public 
debates - is that ‘money never matters.’ The research of course does not say that. Nor does it say that 
‘money cannot matter.’ It simply underscores the fact that there has historically been a set of decisions 
and incentives in schools that have blunted any impacts of added funds, leading to inconsistent 
outcomes. That is, more spending on schools has not led reliably to substantially better results. We 
return to these issues below. 

The most current research on school inputs and achievement has also led to another set of 
conclusions - that teacher quality is enormously important in determining student achievement. This 
work has concentrated on whether some teachers consistently produce more gains in student 
achievement than other teachers. Working with extensive panel data on individual students, these 
studies have confirmed large differences among teachers in terms of outcomes in the classroom. 

But, they have also shown that the observed differences are not closely related to common 
characteristics of teachers (such as amount of teacher education). Some attributes of teachers - such as 
having one or two years of experience - have explained part of the differences in teacher quality, but 
these factors are a small part of the overall variance in teacher results. This inability to identify specific 
teacher qualities makes it difficult to regulate or legislate having high quality teachers in classrooms. It 



More details on the class size analyses and debates can be found in Angrist and Lavy (1999), Hoxby (2000b), 
Krueger (2000), Hanushek (2002a), and WoBmann and West (2006). 

For the historical framing of the question, see the exchange between Greenwald, Hedges, and Laine (1996) and 
Hanushek (1996). 

See, for example, Hanushek (1971, 1992), Murnane (1975), Aaronson, Barrow, and Sander (2003), Rockoff (2004), 
Rivkin, Hanushek, and Kain (2005), Hanushek, Kain, O'Brien, and Rivkin (2005), Boyd et al. (2005), and Kane, Rockoff, 
and Staiger (2006). 
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also contributes to our conclusions below that changes in the institutional structure and incentives of 

7-1 

schools are fundamental to improving school outcomes. 

6.3 Within-country Evidence - Developing Countries 

The evidence on developed countries, while instructive to them, may or may not be similar in 
developing countries. Indeed, one concern about the within-country analyses is that they might not 
provide reliable guidance to other countries. For example, Heyneman and Loxley (1983) argued early 
in the analysis of school effects that schools may be considerably more important for achievement in 
developing countries, where family inputs to education may be less.^"^ Further, the levels of spending 
differ by a factor of 100 in comparing some developed and developing countries so that simple 
translations of results may be inappropriate. 

The research on schools in developing countries has been less extensive than that in developed 
countries. Moreover, the evidence is frequently weaker because of data or analytical problems with the 
underlying studies. Nonetheless, as Pritchett (2004) convincingly argues on the basis of ample 
evidence, just increasing spending within current education systems in developing countries is unlikely 
to improve students’ performance substantially. Overwhelming evidence shows that expansions on 
the input side, such as simple physical expansion of the educational facilities and increased spending 
per student, generally do not seem to lead to substantial increases in children’s competencies and 
learning achievement. The lack of substantial resource effects in general, and class-size effects in 
particular, has been found across the developing world, including Africa (see, e.g., Kremer (2003); 
Michaelowa (2001), Latin America (see, e.g., Mizala and Romaguera (2002); WoBmann and Fuchs 
(2005) and East Asia (see, e.g.,Gundlach and WoBmann (2001); WoBmann (2005a)). 

Again, it is necessary to understand the character of the results. In particular, the evidence refers 
most specifically to overall infusions of resources. They do not deny that some investments are 
productive. A number of studies provide convincing evidence that some minimal levels of key 
resources are frequently valuable in promoting student learning. For example, the simple availability of 



These conclusions are also underscored by the evidence on extensive teacher absenteeism in developing countries 
(see Chaudhury et al. (2006); Banerjee and Duflo (2006)). 

Based on recent international data from TIMSS, however, Hanushek and Luque (2003) question whether this overall 
conclusion holds. 

See Hanushek (1995), Glewwe (2002), Pritchett (2004), and Glewwe and Kremer (2006) for reviews of research on 
the determinants of educational quality in developing countries. 
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textbooks has been shown quite consistently to be important (Heyneman, Jamison, and Montenegro 
(1984), Harbison and Hanushek (1992)). Similarly, Duflo (2001) shows that school facilities can yield 
high returns. Nonetheless, while there are suggestive findings of positive resource effects scattered 
across the literature (see, for example, Lockheed and Hanushek (1988), the main message is still not 
one of broad, resource-based policy initiatives. The impact of these policies and programs, even if we 
presume that they could be replicated elsewhere, is limited. 

Glewwe and Kremer (2006) provide a review of the various problems associated with this work. 
Much of the evidence has come from observational evidence where the causal structure is not fully 
considered. Nonetheless, even among the relatively few studies passing their review criteria, the 
evidence on either specific or general resources remains mixed - and does not suggest any consistent 
resource strategies that might be applied. 

The general story that emerges from work in developing countries remains very similar to that for 
developed countries. Simply providing generally increased resources, or resources along the lines 
commonly suggested such as reducing class sizes or across-the-board increases in teacher salaries, is 
unlikely to lead to substantial changes in student performance. As in developed countries, it appears 
extraordinarily important to get the institutional structure right. 

6.4 Is There a Minimum Resource Requirement? 

The general lack of any systematic relationship between student achievement and resources raises 
the question of whether or not there is some minimum required level of resources even if impacts are 
not seen at higher levels of resources. This almost certainly is the case. It is consistent with the few 
“resource findings” previously indicated about the availability of textbooks, the importance of basic 
facilities, the impact of having teachers actually show up for class, and similar minimal aspects of a 
school. 

At the same time, there is little evidence upon which to build any serious notion of resource needs. 
One might suspect that resource effects would appear to be nonlinearly related to achievement such 
that, below some level, there is a strong relationship between added resources and student outcomes. 
There is little evidence supporting this kind of relationship. 

Part of the difficulty arises from the inference that teacher quality is the most important school 
element but that teacher quality is unrelated to common measures of salary, education, experience, 
certification, and the like. This implies that resources, at least as currently spent, are not effective at 
leading to generally improved teachers. Again, this is not a necessity, but instead merely an observation 
about how the current institutions translate resources into student results. 
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7. Schooling Institutions and Educational Quality 

The earlier evidence suggested that national economies were very dependent upon the quality of 
their economic institutions. It is hard to have a strongly growing economy without complementary 
institutions in the labor and product markets, without openness to trade and investments from the 
outside, and without effective systems of laws and property rights. 

Similarly, we suggest, it is difficult to have a highly functioning education system without a 
supportive institutional structure. On this matter, however, there are more different opinions and 
perhaps a wider divergence in outcomes. Part of the reason for the divergent opinions is simply a lack 
of sufficient experience, analysis, and evidence. 

This having been said, we see some clear general policies that are important. Foremost among 
these, the performance of a system is affected by the incentives that actors face. That is, if the actors in 
the education process are rewarded (extrinsically or intrinsically) for producing better student 
performance, and if they are penalized for not producing high performance, this will improve 
performance. The incentives to produce high-quality education, in turn, are created by the institutions 
of the education system - all the rules and regulations that (explicitly or implicitly) set rewards and 
penalties for the people involved in the education process. Therefore, one might expect that 
institutional features have important impacts on student learning. 

We highlight three institutional features that may be part of a successful system for providing 
students with cognitive skills: choice and competition; decentralization and autonomy of schools; and 
accountability. Given the nature of this study and the limited evidence, we are only able to present 
main ideas here; deeper analyses, in particular of issues of design and implementation in specific 
contexts, have to be left for more encompassing surveys and collections on the matter (e.g., Hanushek 
(1994); Peterson and West (2003); Betts and Loveless (2005); and, for developing-country contexts. 
World Bank (2004), eh. 7; Vegas (2005)), as well as for future research. 

7.1 Choice and Competition in Developing Countries 

Choice and competition in schools were proposed a half century ago by Friedman (1962). The 
simple idea is that parents, interested in the schooling outcomes of their children, will seek out 
productive schools. This demand side pressure will result in incentives for each school to produce an 
effective education system. These incentives will also put pressure on schools to ensure high quality 
staff in addition to a good curriculum. 
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In developed countries, a number of privately managed schools particularly in Europe provide 
alternatives for students. These schools, which also often have a religious affiliation, are part of the 
natural institutional framework. Unfortunately, little thorough evaluation has been done of the choice 
possibilities, in large part because there is no obvious comparison group (i.e., choice is instituted for an 
entire country and there is no example of the no-choice alternative). In a cross-country comparison, 
students in countries with a larger share of privately managed schools tend to perform better on average 
(cf. WoBmann (2005d, forthcoming-b)).^^ 

In the U.S., there are limited examples of private school choice, ranging from the publicly funded 
school vouchers in Milwaukee, Cleveland, and Washington, DC, to privately financed voucher 
alternatives. The evaluations of these generally show that the choice schools do at least as well as the 
regular public schools, if not better (see Rouse (1998), Howell and Peterson (2002)). 

Outside the US, Bradley and Taylor (2002) and Levacic (2004) find similar positive effects of 
school competition on the performance of English schools. Sandstrom and Bergstrom (2005) and 
Bjorklund, Edin, Ereriksson, and Krueger (2004) provide evidence on significant positive effects of 
competition from privately operated schools on the performance of public schools in Sweden. Eiler and 
Munich (2003) show that the introduction of a voucher-type system in the Czech Republic led to the 
creation of private schools in areas where public schools are doing badly and that the public schools 
facing private competition improved their performance in obtaining university admission for their 
graduates. 

The evidence from evaluations in developing countries is also generally consistent with the 
evidence from developed countries in terms of positive effects due to private competition. Eor example, 
Colombia ran a program that provided vouchers for the attendance of private schools. The benefits of 
this program have been found to clearly exceed its cost, which was similar to providing a place in 
public schools (Angrist et al. (2002); also King, Orazem, and Wohlgemuth (1999)). While evidence on 
the extensive voucher system in Chile is less uniform, the most elaborate studies tend to suggest that it 



Note that private school management does not mean private school funding; the international evidence suggests that 
both private school management and public school funding are associated with better performance across countries 
(WoBmann (2005d)). 

The largest U.S. voucher program in the State of Florida provides vouchers for special needs students (Hoxby and 
Murarka (2006)). While there is considerable satisfaction with this program, there is no evaluation available that is based on 
explicit outcome measures. 
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had positive effects on students’ performance.^^ Mizala, Romaguera, and Farren (2002) find that 
private fee-paying schools are the most technically efficient ones in Chile, followed by private 
subsidized and public schools. Similarly, Cox and Jimenez (1991) show that private-school students 
perform better both in Colombia and Tanzania. James, King, and Suryadi (1996) and Bedi and Garg 
(2000) find that privately managed schools in Indonesia are more efficient and effective than public 
schools. 

The major issue on choice and competition is still the limited experience. The teachers unions and 
administrator groups invariable dislike the idea of competition - because it puts pressure on them. 
Thus, not many examples of operational, large-scale attempts at competition have been evaluated. 
Nonetheless, the benefits of competition are so well documented in other spheres of activity that it is 
quite inconceivable that more competition would not be beneficial for schools. 

Choice and competition are very broad terms that can encompass many different programs. It 
seems clear that the specific design of any choice program will be important. The importance seems 
most clear in terms of distributional outcomes, because programs have the potential for leading to 
segregation of the school population in various ways that might not be desirable (cf. McEwan (2000)). 
On this, there is simply a need for greater experimentation and experience, given the current levels of 
uncertainty. 

7.2 Evidence on Autonomy of Schools 

Several institutional features of a school system can be grouped under the heading of autonomy or 
decentralization, including local decision making on different matters, fiscal decentralization, and 
parental involvement. Almost any system of improved incentives for schools depends upon having 
school personnel in individual schools and districts heavily involved in decision making. It is difficult 
to compile evidence on the impact of autonomy, because the degree of local decision making is most 
generally a decision for a country (or state) as a whole, leaving no comparison group within countries. 
Across countries, students tend to perform better in schools that have autonomy in personnel and day- 
to-day decisions (WoBmann (2003a, forthcoming-b)), in particular when there is accountability (see 
below). 



Cf., e.g., Mizala and Romaguera (2000), Sapelli and Vial (2002), Vegas (1999) and Hsieh and Urquiola (2006). A 
less positive interpretation of vouchers in Chile is found in McEwan and Carnoy (2000) and of vouchers in general in 
McEwan (2000). 
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The U.S. states have varying amounts of local autonomy. One systematic form of school autonomy 
is “charter schools,” which are public schools that are allowed to perform quite autonomously. (Note 
that these are actually hybrids of choice schools and public school autonomy, because they survive only 
if sufficient numbers of students are attracted to them and continue to attend them). These schools are 
relatively new, a fact that complicates evaluation since many are still in the start up phase. Nonetheless, 
while controversial, the evidence on them suggests that they perform at least as well if not better than 
the regular public schools after the initial start up phase (Hoxby and Rockoff (2004), Sass (2006), 
Bifulco and Ladd (2006), Hanushek, Kain, Rivkin, and Branch (forthcoming)). Also, given U.S. 
residential mobility, individual public school districts compete with each other, and more competition 
appears to produce better results (Hoxby (2000a)). 

There is evidence from within a few developing countries that supports positive effects of 
decentralization, school autonomy and community involvement. Focusing on parental involvement, 
Jimenez and Paqueo (1996) find that local financial contributions increased the productivity of public 
schools in the Philippines relative to central financing. Jimenez and Sawada (2001) show that enhanced 
community and local involvement improved student learning in El Salvador. Galiani and Schargrodsky 
(2002) show that decentralization in the Argentine secondary school system improved educational test 
performance. Vegas (1999) finds that teacher autonomy positively affects student outcomes in Chile 
when decision making authority is decentralized. Both Gertler, Rubio-Codina, and Patrinos (2006) and 
Alvarez, Moreno, and Patrinos (2006) find that decentralization of decision making to the local level in 
Mexico positively affects student outcomes, while the latter also find accountability to be very 
important in enhancing local decision making. Finally, Skoufias and Shapiro (2006) suggest for 
Mexico that the combination of increased school resources and local school management can produce 
small but statistically significant improvements in learning. 

Nonetheless, given the available evidence, support for autonomy also strongly rests on a 
conceptual basis. It is simply hard to imagine a system with strong incentives that does not capitalize 
on local decision making. 

7.3 School Accountability 

Many countries around the world have been moving toward increased accountability of local 
schools for student performance. The United Kingdom has developed an elaborate system of “league 
tables” designed to give parents full information about the performance of local schools. The United 

See Rothstein (2005) and Hoxby (2005) for further discussion of this evidence. 
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States has legislated a federal law that all states develop an accountability system that meets certain 
general guidelines. It also sets into law a series of actions required when a school fails to being 
sufficient numbers of students up to proficiency in core subjects. 

Evidence on the impacts of these systems has begun to accumulate. While there is some 
uncertainty given the newness of the overall accountability system (introduced in 2002), the best U.S. 
evidence indicates that strong accountability systems in fact lead to better student performance (Carnoy 
and Loeb (2002); Hanushek and Raymond (2005); Jacob (2005)). 

One institutional set-up that combines accountability with parental choice are systems which give 
students in schools that repeatedly do badly on the accountability test a voucher to attend private 

OA 

schools. In Florida, the threat of becoming subject to private-school choice if failing on the test has 
been shown to increase school performance particularly for disadvantaged students (West and Peterson 
(2006); Figlio and Rouse (2006)). 

Curriculum-based external exit exams are another means to introduce some form of accountability 
into the schooling system. They provide performance information which can hold both students and 
schools accountable. Students in countries with external exit exam systems tend to systematically 
outperform students in countries without such systems (Bishop (1997); Bishop (2006); WoBmann 
(2001, 2003a, forthcoming-b)). In the two national education systems where the existence of external 
exams varies within the country because some regions feature them and others not, Canada and 
Germany, it has similarly been shown that students perform better in regions with external exams 
(Bishop (1997); Jiirges, Schneider, and Biichel (2005)). 

Fittle evidence is currently available about accountability systems in developing countries. This 
reflects the generally weak accountability in these countries along with a general lack of systematic 
measurement and reporting of student achievement. However, Alvarez, Moreno, and Patrinos (2006) 
find a strong impact of accountability and local decision making in Mexico, and this remains even with 
consideration of the relative impact of teachers unions. 

It is difficult to imagine choice or autonomy working well without a good system of student testing 
and accountability. Thus, the ideas about institutional structure are closely linked together (cf. World 
Bank (2004)). The international evidence clearly suggests that school autonomy, in particular local 
autonomy over teacher salaries and course content, is only effective in school systems that have 
external exams in place (WoBmann (2005c, forthcoming-b); Fuchs and WoBmann (forthcoming)). One 

The legality of this system has been challenged in the Florida courts, so that the future of the program in Florida is in 

doubt. 
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example of this evidence is depicted in Figure 7.1, which plots relative student performance under the 
four conditions resulting from the presence and absence of central exams and school autonomy over 
teacher salaries, after controlling for dozens of student, family, and school background factors. The 
figure shows that school autonomy regarding teacher salaries is negatively associated with student 
performance in systems without central exams. In systems with central exams, student performance is 
generally higher than in systems without central exams, reflecting the increased accountability. In 
addition, the effect of school autonomy is turned completely around in systems with central exams: 
Salary autonomy of schools has positive effects on student performance in central-exam systems. This 
pattern of results has been found in TIMSS, TIMS S -Repeat, and PISA, and similar cases where 
external exams turn a negative autonomy effect around into a positive effect have been found for other 
decision making areas such as school autonomy in determining course content and teacher influence on 
resource funding. 



Figure 7.1: Accountability, Autonomy, and Student Performance across Countries 

Performance difference between the four categories relative to the lowest category, after controlling for numerous additional 
impact factors. 



Math performance 
in TIMSS/TIMSS-R 
test scores (relative 
to lowest category) 




Source: WoBmann (2005c). 
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Given the importance of high teacher quality, a promising candidate for improvement is the 
specific form of accountability that aims incentives directly at teachers. While convincing evidence on 
the effects of performance-related teacher pay is scarce, the more rigorous studies in terms of empirical 
identification tend to find a positive relationship between financial teacher incentives and student 
outcomes (cf. the survey in Atkinson et al. (2004) along with the discussions in Vegas (2005) and 
Vegas and Umansky (2005)). Thus, monetary incentives for teachers based on their students’ 
performance have been shown to improve student learning in Israel immensely (Lavy (2002, 2004)). 
Similarly, Atkinson et al. (2004) find that the introduction of performance-related pay had a substantial 
positive impact on student achievement in England. 

7.4 Summary of How to Improve the Quality of Education 

From the perspective of policies to obtain better outcomes, a simple picture emerges. First, the 
evidence from extensive research shows clearly that “pure resource policies” - i.e., policies that simply 
provide more resources within the current incentive structure of schools within most countries - are 
unlikely to produce substantial systematic gains in student outcomes. There are of course qualifications 
and caveats. Some schools will use added resources effectively. And, some systems that fail to provide 
minimal resources such as basic textbooks could, if resources were applied to the key shortage areas, 
get improvement. Yet, both developed and developing countries demonstrate an improbable but 
tangible disconnect between simple resource solutions and student achievement. 

Second, the evidence about what specifically counts in terms of schools is rather limited. But the 
central candidate is high teacher quality. Unfortunately, the characteristics of good teachers are not well 
understood, making regulatory solutions very difficult and supporting the development of rewards for 
high quality teaching. On this score, most countries - both developed and developing - have incentive 
systems built into teacher salaries and teacher contractual arrangements that resist significant change 
and that are not aligned well with teacher quality. 

Third, the key to improvement appears to lie in better incentives - incentives that will lead to 
management keyed to student performance and that will promote strong schools with high quality 
teachers. Here, three interrelated policies come to the forefront: promoting more competition, so that 
parental demand will create strong incentives to individual schools; autonomy in local decision making, 
so that individual schools and their leaders will take actions to promote student achievement; and, an 
accountability system that identifies good school performance and leads to rewards based on this. 

We have sketched three separate components of improved incentives, but it is also important to 
point out that they form a package. Focal autonomy without strong accountability may be worse than 
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doing nothing. Accountability without choice is likely to be watered down and made impotent by 
schools that would prefer no accountability. Choice without good information about performance has 
uncertain outcomes attached to it. In other words, these should not be thought of as isolated policies 
that can be independently introduced while retaining their advantages. 
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8. Conclusion 



This study was motivated by doubts that have been raised about the role of education and human 
capital in economic development. These doubts come from a variety of vantage points ranging from 
whether the research has correctly identified the impact of education to whether other institutional 
aspects of countries might be more important. They also encompass concerns about whether or not we 
really know how to change educational outcomes, particularly in developing countries. 

Our analysis has produced some remarkably simple but clear conclusions. 

1. Educational quality - measured by what people know - has powerful effects on individual 
earnings, on the distribution of income, and on economic growth. 

The accumulated evidence from analyses of economic outcomes is that the quality of education - 
measured on an outcome basis of cognitive skills - has powerful effects. Much of the earlier discussion 
has concentrated solely on school attainment, or the quantity of schooling. This focus is unfortunate, 
because it distorts analysis and the policy discussions. 

Individual earnings are systematically related to cognitive skills. The distribution of skills in 
society appears closely related to the distribution of income. And, perhaps most importantly, economic 
growth is strongly affected by the skills of workers. 

Other factors obviously also enter into growth and may well have stronger effects. For example, 
having well-functioning economic institutions such as established property rights, open labor and 
product markets, and participation in international markets have clear importance for economic 
development and may also magnify the benefits of quality education. Nonetheless, existing evidence 
suggests that quality of education independently affects economic outcomes even after allowing for 
these other factors. 

Moreover, the existing research provides strong reasons to believe that quality of education is 
causally related to economic outcomes. To be sure, quality may come from formal schools, from 
parents, or from other influences on students. But, a more skilled population - almost certainly 
including both a broadly educated population and a cadre of top performers - results in stronger 
economic performance for nations. 
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2. The current situation in developing countries is much worse than generally pictured on the 
basis just of school enrollment and attainment. 

Available measures of school attainment uniformly indicate that developing countries lag 
dramatically behind developed countries. This fact has driven a variety of efforts to expand schooling 
in developing countries, including the Education for All initiative. Yet, much of the discussion and 
much of the policy making has tended to downplay the issues of quality. 

International testing indicates that, even among those attaining lower secondary schooling, literacy 
rates (by international standards) are very low in many developing countries. By reasonable 
calculations, a range of countries has fewer than 10 percent of its youth currently reaching minimal 
literacy and numeracy levels, even when school attainment data look considerably better. 

Because of the previous findings - that knowledge rather than just time in school is what counts - 
policies must pay more attention to quality of schools. 

For developing countries, the sporadic or nonexistent assessment of student knowledge is an 
especially important issue - correcting this shortcoming should have the highest priority. It is 
impossible to develop effective policies without having a good understanding of which work and which 
do not. Currently available measures of program “quality” frequently rely upon various input measures 
that unfortunately are not systematically related to student learning. Moreover, the existing 
international tests - such as the PISA tests of the OECD - may not be best suited to provide accurate 
assessments of student performance in developing countries. The evolving capacity for adaptive testing 
that can adjust test content to the student’s ability level seems particularly important in the developing 
country context. Adaptive testing offers the possibility of meaningful within-country variation in scores 
along with the ability to link overall performance with global standards. 

3. Just providing added resources to schools is unlikely to be successful; improving the quality of 
schools will take structural changes in institutions. 

Even though attempts to improve quality have frustrated many policy makers around the world, we 
now have extensive research that reaches some clear conclusions. Research has delved deeply into the 
impact of adding resources within the current institutional structure (of both developed and developing 
countries). The overall finding is that simple resource policies - reducing class sizes, increasing teacher 
salaries, spending more on schools, and so forth - have little consistent impact on student performance 
when the overall institutional structure is not changed. 
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This does not say that spending never has an impact. In fact, there is reason to believe that some 
basic resources in the least developed schools, such as providing textbooks for all students, have a 
reliable impact. But, these situations have been documented just in the poorest schools and, even there, 
just in limited areas. There is also evidence that some schools use added resources better than others, 
although research does not characterize these different situations well enough to build them into overall 
resource policies. 

There is mounting evidence that the quality of teachers is the key ingredient to student 
performance. Unfortunately, the characteristics of good teachers are not described well, making it 
impossible to legislate or regulate good teachers. This motivates our institutional discussions below. 

The evidence also does not say that resources can never have an impact. In fact, the kinds of 
institutional changes that we identify are designed precisely to ensure that added resources are 
effective. 

The largest problem that we see in current school policy is the lack of incentives for improved 
student performance. Neither students nor school personnel in most countries of the world tend to be 
significantly rewarded for high performance. Without such incentives, it is no surprise to find that 
added resources do not consistently go toward improvement of student outcomes. 

Three sets of policies head our list for items needed to improve the overall incentives in schools: 
strong accountability systems that accurately measure student performance; local autonomy that allows 
schools to make appropriate educational choices; and choice and competition in schools so that parents 
can enter into determining the incentives that schools face. 

As noted, many if not most developing countries currently lack performance measurement that 
would allow them to know which policies were working and which were not or where performance was 
most in need of change. Lack of measurement of student outcomes clearly makes any system of direct 
rewards for success difficult if not impossible. An early step in any reform program should be 
instituting reliable school accountability systems. 

If schools are held responsible for results, they must have the ability to make decisions that will 
lead to better outcomes. Highly centralized regulatory systems simply cannot work effectively without 
broad knowledge of what programs are effective in different situations. Such knowledge is currently 
lacking, leading to centralized decision making that does not produce strong results. (The lack of 
results from centralized decision making can be readily seen in the data provided, particularly for 
developing countries.) 

Finally, in terms of overall changes, more choice and competition among schools leads parents to 
be directly involved in evaluating the performance of schools. While experiments in various choice 
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plans have been limited, the existing evidence suggests that they do tend to lead to better student 
outcomes. The best way to introduce choice in rural systems within resource-constrained developing 
countries is not fully known now, but it appears clear that the existing system is not working. 

Uncertainty about the best design of incentive programs for schools is most acute in developing 
countries, largely due to lack of relevant experiences. For this reason, it is especially important to 
implement a program of experimentation and evaluation - a key missing aspect of policy making in 
most developing countries. Education policy must be viewed from an evolutionary perspective, where 
on-going evaluation permits discarding policies that are ineffective while expanding those that are 
productive. 

When asking how education policies in developing countries can create the competencies and 
learning achievements required for their citizens to prosper in the future, the binding constraint seems 
to be institutional reforms, not resource expansions within the current institutional systems. For 
educational investments to translate into student learning, all the people involved in the education 
process have to face the right incentives that make them act in ways that advance student performance. 
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